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This  dissertation  is  concerned  with  the  estimation  of  a matrix  normal  means  in  otre- 
and  two-sample  problems.  In  both  cases  the  empirical  Bayes  method  of  estimation  is  used. 

is  not  necessarily  an  identity  matrix,  while  the  latter  has  three  possible  structures.  The 

matrix.  In  the  one-sample  problem,  the  proposed  EB  estimators  are  uniformly  better  than 
the  sample  mean,  a well-known  mtntamx  estimator,  and  therefore  are  minimax.  In  the  two- 
sample  problem,  the  first  sample  mean  is  a minmax  estimator  of  the  corresponding 
population  normal  means  and  is  dominated  by  proposed  EB  estimators.  In  this  case,  they 
are  weighted  averages  of  the  first  sample  mean  and  pooled  sample  mean,  and  incorporate  in 
a very  natural  way  that  the  degree  of  evidence  for  or  against  the  hypothesis  of  the  equality 

modified  EB  estimator  which  equals  the  pooled  sample  mean  when  the  null  hypothesis  is 


CHAPTER  1 
INTRODUCTION 


Stein  ( 1956),  in  his  seminal  paper,  discovered  the  surprising  result  that  the  sample 
mean  vector  X of  a K (>  2)  dimensional  normal  distribution  Ng(2,  1K)  is  an  inadmissible 
estimator  of  its  mean  2 under  the  squared  error  loss  L(2,  a)  = (fl-fl)T(fi-a),  where  IK  is  the 
identity  matrix  of  order  K.  An  explicit  estimator  dominating  the  sample  mean  was  provided 
by  James  and  Stein  (1961)  under  the  same  setup.  Since  then,  a large  body  of  research  has 

covariance  structure  M of  X and  under  different  quadratic  losses  L](2,  a)  = (S-a)TQ(S-a) 
or  Ljifi,  a)  = (9-a)TQ(9-aVtr(QM).  Where  Q is  a known  KxK  symmetric  positive  definite 

athitraiy  dimension  under  the  L|  loss  for  a known  M or  under  the  L?  loss  for  known  or 
unknown  M.  any  estimator  dominating  the  sample  mean  is  also  a miniamx  estimator  of  2 
Much  of  the  earlier  research  on  this  topic  has  concentrated  on  finding  a general  class  of 
minimax  estimators  in  cases  where  M is  known  and  Q = Jg,  or  M is  known  up  to  a positive 
multiple  and  Q = 1K  or  when  Q = M'1  and  M is  known  or  unknown.  This  research  initiated 
by  Baranchik  (1970),  and  was  culminated  eventually  in  the  work  of  Efron  and  Morris 
(1976a).  For  the  more  general  situation,  Berger  and  Bock  (1976a.  1976b)  and  Shinoaaki 
(1977)  have  found  minimax  estimators  under  the  assumption  that  M is  an  unknown 

more  resistant  to  solution.  Berger  et  al.  (1977),  Gleser  (1979.  1986a,  1986b)  and  Berger 
and  Half  (1983)  have  succeeded  in  developing  estimators  which  dominate  X in  risks  when 


: placed  on  M and  Q.  1 


oice  of  Q. 


Brown  (1975)  and  Shinozaki  (1980)  have  provided  sufficient  conditions  under  which  a 
class  of  estimators  nor  depending  on  Q dominates  the  sample  mean  simultaneously  under 
several  diagonal  Q's  and  the  identity  covariance  matrix.  However,  the  class  of  Q's  for 
which  the  dominance  holds  turns  out  to  be  very  narrow.  For  the  case  that  covariance  matrix 

developed  a wide  class  of  minimax  estimators  not  depending  on  Q.  This  class  of  estimators 
turns  out  to  be  a generalization  of  the  class  of  minimax  estimators  corresponding  to  the  case 

result  of  Ghosh  (1989)  is  stated  below  without  proof. 

Theorem  1.1.1  Let  X - NK(fl,  o2lK)  where  o2  (>  0)  is  an  unknown  constant,  but  an 
estimator  o2,  independent  of  X and  distributed  as  o2(n+2)'1x„,  is  available.  Assume  that 
triQ)  > 2ch,(Q),  where  ch,(Q)  denotes  the  largest  eigen  value  of  Q.  and  t(F)  is  a scalar 
function  differentiable  in  F with  the  derivative  x'fF)  and  F = QjJxyS2.  Suppose 

(i)  0 < r(F)  < 2[trtQ)/ch,(Q)-2); 

(u)  F<K-2wt(F)A2-t(F))1',2<K'2w,M-2)  is  increasing  in  F. 

Then  (1-T(F)/F)X  dominates  X under  the  quadratic  loss  L(8.  a)  = (S-a)TQ(fi-a)  or  L(fi,  a) 
= (2-a)TQ(fi-aVtr(o2Q). 

Stein's  phenomenon  can  be  extended  to  the  problem  of  estimating  a matrix  of 

normal  means.  Suppose  X, Xk  are  K independent  p-dimensional  column  vectors  (K 

>p+l)  such  thatXi  is  normally  distributed  with  mean  vector  fij  and  the  identity  covariance 
matrix  Ij,.  In  this  case,  the  sample  mean  matrix  X = (X|. ....  Xk)  is  a minimax  estimator  of 


Q = (fi|, ....  fix)  under  the  squared  error  loss  L(fl,  a)  = tr((ft-a)T(fi-a)}.  Efron  and  Morris 
(1972b.  1976b).  Stein  (1974),  Haff  (1977.  1979),  Dey,  Ghosh  and  Srinivasan  (1985)  and 
Zheng  (1986)  have  provided  estimators  that  are  uniformly  better  than  X thereby  proving  the 
inadmissibility  of  X-  However,  these  results  are  restricted  to  the  identity  covariance  matrix 
and  squared  error  loss.  For  the  more  general  situation  of  quadratic  loss  and  covariance 
matrix  V of  Xfs,  the  results  thus  far  obtained  arc  very  incomplete.  Efron  and  Morris 
(1972b)  have  considered  the  problem  when  the  p.d.  covariance  matrix  Y is  completely 

matrix  and  completely  unknown  matrix.  Under  the  special  loss  function  Lffl,  g)  = 
tr((a-g)Tr'(fi-»)).  these  authors  proposed  the  matrix  extension  form  James-Stein 
estimator  (lp-(K-p-l)¥.S.-|)X  when  Y is  known  where  S.  = XTX-  The  estimator 
(Ip-fK-p-llYS’^X  was  proposed  when  V is  unknown  and  Y.  independent  of  X.  is  an 

estimate  of  Y- 

For  estimating  the  vector  normal  mean,  Efron  and  Morris  (1972a,  1973a)  motivated 
the  Jamcs-Stcin  estimator  from  an  empirical  Bayes  (EB)  point  of  view.  They  continued  the 
effort  to  motivate  the  estimators  described  in  the  previous  paragraph  as  EB  estimators.  For 
Q = Y = ip.  more  general  EB  estimators  were  provided  by  Haff  (1979)  and  Dey,  Ghosh 
and  Shrinivasan  (1985). 

unknown).  All  the  different  structures  of  the  covariance  matrix  Y given  in  Efron  and  Morris 
(1972b)  are  covered.  The  loss  function  being  considered  is  Lgffl,  a)  = tr|(fi-a)TQ(2-a)) 
when  Y is  known  or  Lg*(&  a)  = tr(  (2-a)TQ’(!i-a)  I when  Y is  known  or  unknown,  where 

q’  « Y"  l,JQ  Y 1/2  and  Y = v}ny}n. 

The  principal  idea  behind  an  EB  approach  is  to  “borrow  strength  from  the 


: should  be  applied 


suspected  of  having  same  aero  prior  mean  and  different  prior  variances.  Efron  and  Morris 
(1973b)  imposed  a distribution  on  the  ratio  of  the  prior  variances  and  derived  a class  of 
compromise  estimators  which  is  superior  than  cither  alternatives  in  terms  of  Bayes  risks 
under  the  squared  error  loss.  Berger  and  Dey  (1983)  have  also  considered  the  problem  of 
combining  groups  when  simultaneously  estimating  normal  means.  Their  situation  is 
substantially  different  from  the  one  in  Efron  and  Morris  (1973b),  since  no  symmetries  and 
EB  relationships  were  assumed  to  exist.  These  authors  applied  two  types  of  estimators  and 

loss.  Another  type  of  grouping  problem  can  be  found  in  Ghosh  and  Sinha  (1988).  They 
considered  the  situation  when  there  were  two  groups  of  observations  but  interest  lay  in 
estimating  the  vector  normal  means  of  the  first  sample.  Die  question  is  whether  the  pooled 
sample  mean  or  the  first  sample  mean  should  be  used  to  estimate  the  first  group  of  normal 
means  when  the  two  groups  of  normal  means  arc  suspected  to  be  equal.  Both  EB  and 
hierarchical  Bayes  (HB)  estimators  were  proposed  and  shown  to  dominate  the  first  sample 
mean  under  the  quadratic  loss.  Unlike  Berger  and  Dey  (1983),  these  estimators  did  not 
involve  the  weight  matrix  Q.  Further,  modified  EB  estimators  were  proposed,  and 

given.  The  PTE  uses  the  pooled  sample  mean  when  the  null  hypothesis  of  the  equality  of 
the  two  population  means  is  accepted  at  a desired  level  of  significance  and  uses  the  first 

The  second  objective  of  this  research  generalizes  the  problem  described  above  into 
the  estimation  of  a matrix  normal  means  when  two  matrix  sample  means  are  available  and 
they  are  suspected  to  have  same  population  matrix  normal  means.  The  proposed  EB  and 
modified  EB  estimators  are  evaluated  in  terms  of  risks  under  the  quadratic  loss  Lglfl,  a)  = 


tr((fi-a)TQ(e-a)l  when  ihe  covariance  matrix  is  known  or  Lg-O.  i)  = tr|(fi-DTQ*(ft*M 


normal  means  in  one-  and  two-sample  problems.  Although,  it  is  known  from  the  previous 
literature  that  Stein's  phenomenon  exists  in  the  problem  of  estimation  of  matrix  normal 
means,  our  study  addresses  the  problem  in  greater  generality  situations.  Three  goals  have 
been  set  and  achieved.  Ftrst,  a general  covariance  matrix  Y (known  or  unknown)  is 
considered.  Second,  we  consider  a general  quadratic  loss  Lg(fi,  a)  = tr((fi-g)TQ(fi-a))  or 
Lg-(2,1)  = tr((fi.-a)TQ*(£-k)|.  unlike  the  previous  work  on  matrix  normal  means 
estimation  where  Q = Ip-  Finally,  the  proposed  estimators  dominating  the  matrix  sample 
mean  do  not  depend  on  the  weight  matrix  Q,  and  hence  they  are  robust  against  the 
misspecitication  of  Q. 

Both  Chapter  2 and  Chapter  3 are  devoted  to  the  estimation  of  matrix  normal  means 
in  the  one-sample  problem.  In  Chapter  2,  we  assume  a completely  known  arbitrary 

known  up  to  a positive  constant  multiple,  second,  an  unknown  diagonal  matrix,  and  third, 
a completely  unknown  matrix.  It  is  assumed  that  in  each  case,  there  is  an  estimate  available 
to  estimate  the  unknown  covariance  matrix  and  the  estimate  is  independent  of  the  observed 
sample  mean.  These  are  discussed  in  Sections  3.2-3.4.  respectively. 

In  Chapter  4 and  Chapter  5,  we  consider  the  estimation  of  matrix  normal  means  in 

stems  directly  from  Ghosh  and  Sinha  (1988).  The  analyses  of  the  first  pans  in  both 
Chapter  4 and  Chapter  5 are  very  much  parallel  to  the  analyses  in  Chapter  2 and  Chapter  3. 


In  nn  EB  framework,  we  propose  a general  form  of  EB  estimator  0EB  (§5®  in  Chapter  4 
and  Chapter  5)  which  bears  a strong  resemblance  to  Baranchik’s  (1970).  Actually  they  are 
exactly  the  same  when  the  dimension  of  the  matrix  normal  means  reduce  from  pxK  to 
lxK.  Two  classes  of  EB  estimators.  §“<'>  and  fP®  (Sf®  and  ftf®),  are  derived  and 
are  shown  to  be  uniformly  better  than  the  sample  mean  in  one-sample  problem  ( or 
unifotmly  better  than  the  first  sample  mean  in  two-sample  problem)  under  the  quadratic 
loss Lgffl.  a)  = tr( (2-a)TQ(6-a)l  or  Lg-ffl. a)  = tr((fi-a)TQ'(e-a)}.  We  have  also  shown 
under  certain  conditions  a88®  (ftf80*)  is  improved  by  fP®  (If®)-  Since  the  matrix 
extension  form  of  James-Stein  estimator  belongs  to  the  first  class,  our  result  shows  its 

own  positive-part  version)  has  been  found  to  dominate  the  James-Stein  estimator  for  the 
estimation  of  vector  normal  means. 

In  Sections  4.3  and  5.3.  the  problem  of  how  to  reach  a compromise  between  the 
first  sample  mean  and  pooled  sample  mean  is  investigated.  In  problems  of  this  type,  what 
is  normally  sought  is  a compromise  estimator  which  leans  more  towards  the  pooled  sample 
mean  when  the  hypothesis  of  the  equality  of  the  two  population  means  is  accepted,  and 
towards  the  first  sample  mean  when  such  a hypothesis  is  rejected.  One  classical  way  to 
achieve  this  compromise  is  to  use  the  PTE,  although  the  procedure  suffers  from  the 
drawback  that  there  is  no  general  way  of  incotporating  the  degree  of  evidence  for  or  against 
the  null  hypothesis  in  order  to  anive  at  the  estimator.  For  instance,  if  the  observed  statistic 

no  persuasive  reason  for  choosing  one  estimator  in  preference  to  the  other.  However,  the 
EB  minimax  estimators  (a™1*,  gf®  and  Sf8®).  proposed  in  Sections  4.2  and  5.2 
incorporate  the  degrees  of  evidence  for  or  against  the  null  hypothesis  in  a very  natural  way. 
Further,  their  modified  versions  (fi}®8®.  gj®8®  and  Bp8®)  are  shown  to  dominate 


a.  ll  equals  the 


ihc  corresponding  PTE  in  terms  of  risks.  Such  a modified  EB  minimax  estima 
to  be  a weighted  average  of  the  first  sample  mean  and  the  pooled  sample  mean 
pooled  sample  mean  when  the  hypothesis  is  accepted  at  a desired  level  of  significance,  and 
equals  the  EB  minimax  estimator  when  the  opposite  is  the  case. 

Numerical  studies  are  undertaken  in  Section  2.4  and  Section  3.5  to  illustrate  the 
applicability  of  the  estimators  proposed  in  Section  2.2-2.3  and  Section  3. 2-3. 4. 

study  are  given  in  Chapter  6. 


which  have  been  found  to  be  most  useful  are  Stein's  (1974)  integration-by-parts  identity 

difference  for  competing  estimators  of  Q and  Wishart  identity  due  to  Haff  (1982).  A scalar 
version  of  the  Wishart  identity,  usually  referred  to  as  the  chi-square  identity,  appeared  in 
Efron  and  Morris  (1976a).  These  identities  are  now  given  below. 

Lemma  1.3.1  (Stein's  identity)  Suppose  , a pxl  column  vector,  has  a multivariate 
normal  distribution  Np(fi,  try  and  hQ£)  is  differentiable  with  respect  to  Xt,  i = 1, ....  p. 


(1.3.1) 


, Let  w(v)  be 


Lemma  1.3.2  (Chi-square  identity)  Suppose?  is  distributed  as  v(n+2)''zn,  wl 
is  a constant  and  zj  is  the  chi-square  distribution  with  n degrees  of  freedom, 
differentialbe  in  v and  w'(v)  denote  the  derivative  of  w(v).  Then 

Hy[w(v)vv-1]  = Ev[(5^w(?)+5;^5w,(v)v).  (1.3.2) 

Lemma  1.3.3  (Wishart  identity)  Assume  that  S = «Sij)).  a pxp  matrix,  has  a nonsingular 
Wishart  distribution  with  matrix  £ and  K degrees  of  freedom,  i.e.,  £ - W(p.  K,  2),  where 
K > p+1.  Let  E>s  = ((d?F))  and  df  = 1/2(1+8^3/38^,  Sy  is  the  Kronecker  delta,  i.  j = 

1 p.  Then  for  a suitable  pxp  matrix  W = W(£),  we  have  the  Wishart  identity 

EjJT'MS)]  = Es12QsW(S)+(K-p-l)S'lW(S)l.  (1.3.3) 

(1974)  and  Haff  (1979). 


CHAPTER  2 

EMPIRICAL  BAYES  MINIMAX  ESTIMATORS  OF  MATRIX  NORMAL  MEANS 
FOR  ARBITRARY  QUADRATIC  LOSS  AND  KNOWN  COVARIANCE  MATRIX 
IN  THE  ONE-SAMPLE  PROBLEM 


2.1  imroducupn 


The  problem  of  estimation  of  matrix  normal  means  can  be  stated  as  follows:  Let 
Xi. ....  Xk  be  K independent  p-dimensional  column  vectors.  K > p+1,  with  Xi  having 
conditional  distribution 


Xilfii-Npfai.Y),  (2.1.1) 

we  will  consider  an  unknown  V.  Let  fl  = (fi, Sk).X=(Xi Xr).  The  problem  is 

(MANOVA)  model 

ij=a  + fiij(i  = l K:  j = 1 m),  (2.1.2) 

where  Xjj  is  a p x 1 vector  of  responses  of  the  jth  individual  receiving  the  ith  treatment,  and 
8,  is  the  p x 1 vector  of  ith  treatment  effects.  The  vectors  Sij's  are  independently  distributed 
as  Np(Q,  Yfl),  where  Yo  is  a known  pxp  symmetric  p.d.  matrix.  The  minimal  sufficient 

statistic  for  2 = (fit Sr)  isX  = (Xi Xk).  where  Xi  - m1  Z Xij-  Note  that  the 

Xi's  are  independendy  distributed  as  NplBi,  Y),  where  Y = m‘'Yo  and  K > p+1. 

In  a Baysian  framework,  (2.1.1)  is  interpreted  as  the  conditional  distribution  of  Xi 
given  8|.  Suppose  also  that  Si's  have  independent  distributions 


(2.1.3) 


Si  i Xi = a - NpfOp-yjr'jsi.  dp-yr'jy),  (2.1.4) 

where  Ip  is  the  idendiy  matrix  of  order  p and  £ = (Y+A).  Under  the  quadratic  loss 

Lgffl.  1)  - S (Brai)TQ(Bi-ai).  (2. 1.5) 

where  Q is  p.d.,  the  Bayes  estimator  of  Q is  given  by 

(Pw  - Op-yr'ix.  (2.i.6) 

In  the  EB  scenario.  A and  S are  unknown,  and  arc  estimated  from  the  marginal  distribution 
of  H.  Note  that  the  marginal  distributions  of  2fc's  are  mutually  independent  with 

2fc~Np(Q,2),  (2.1.7) 

and  S = £2(&T  = XXT‘s  complete  sufficient  for  £.  where  S~  Wishart  W(p.  K.  2).  A 

natural  candidate  for  the  choice  of  2'1  is  (K-p-l)S'1.  the  best  multiple  of  S'1.  More 
generally,  one  can  consider  the  class  of  EB  estimators 

aEB(2Q  = (Ip-YS',«YSM  (2.1.8) 

of  fi . where  1(Y.  S)  is  a pxp  symmetric  matrix.  Since  V is  known,  the  appearance  of  Y in 
l(Y  £)  is  superfluous.  But  we  prefer  to  continue  with  this  notation,  since  later  for 
unknown  Y we  will  use  «Y,  S).  where  Y estimates  Y 

Estimation  of  the  matrix  normal  means  was  also  considered  in  Efron  and  Morris 
(1972b,  1976b),  Stein  (1974),  Haff  (1977,  1979),  Dey,  Ghosh  and  Srinivasan  (1985), 
and  Zheng  (1986)  among  others.  Efion  and  Morris  (1972b)  showed  that  (Ip-(K-p-l)S’')Y 
dominated  2t  uniformly  in  8 under  the  loss  (2.1.5)  with  Q = Ip,  i.e.,  the  squared  etror  loss. 


and  Y=Ip.  Later,  Stein  (1974)  and  Efron  and  Morris  (1976b)  proposed  estimators  of  the 
form  Op-afi-'-Wtr  S)''lp)X  for  fi  which  dominate  X for  certain  choices  of  the  constants  a 

0 under  the  quadratic  loss  (2.1.5)  for  every  p.d.  Q,  any  estimator  dominating  X is  also 

Theorem  2 (page  75)  generalized  Baranchik's  (1970)  results  on  minimax  estimation  of  the 
vector  of  normal  means. 

It  will  be  shown  in  Section  2.2  that  EB  estimators  of  the  form  (2.1.8)  dominate  X 
under  the  loss  (2.1.5).  Thus,  the  results  in  this  section  generalize  the  previous  results  in 
two  directions.  First,  V is  not  necessarily  Ip.  Second,  the  loss  is  the  more  general  (2. 1.5) 
with  an  arbitrary  Q instead  of  Q = lp.  We  shall  see  that  the  problem  of  finding  estimators 
which  dominate  X under  the  loss  (2.1.5)  for  every  p.d.  Q amounts  to  the  problem  of 
finding  solutions  to  differential  inequalities  of  a certain  type.  As  a special  case  (Theorem 

(21.5)  when  Y is  not  necessarily  lp.  However,  other  minimax  estimators  will  be  proposed 
where  conditions  required  in  Theorem  2 of  Zheng  (1986)  are  not  met.  Some  of  these 
estimators  are  different  from  the  ones  proposed  in  Haff  (1979)  and  Dey,  Ghosh  and 
Srinivasan  (1985). 

It  will  be  shown  in  Section  2.2  (Example  2.2.1)  that  EB  estimators  of  the  form 
(Ip-aYS.'')X  dominate  X for  every  0 < a < 2(K-p-l)  under  the  loss  (2.1.5).  The  best 
choice  of  a is  K-p-1.  Stein  (1974)  and  Efron  and  Morris  (1976b)  provided  estimators 
dominating  (Ip-(K-p-  1)YS'‘)X  when  Y = IP  and  the  loss  was  squared  error.  Under  the 
same  set  up,  Haff  (1979)  provided  estimators  dominating  Op-aS'^XforO  <aSK-p-l. 
We  provide  in  Section  2.3  several  estimators  which  dominate  dp-r(Y.  £)YS‘')X  for  0 < 
r(Y,  S)  < K-2.  Again,  the  previous  results  arc  generalized  in  two  directions,  i.e.,  Y is  not 
necessarily  lp  and  the  loss  is  (21.S)  instead  of  squared  eiror  loss.  Moreover,  it  will  be 
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encouraging  10  find  (hat  some  of  the  esiimalors  for  S"1  proposed  by  Haff  (1979)  or  Dey, 
Ghosh  and  Srinivasan  (1985)  can  be  used  for  £-lKY,  S)  in  (2.1.8)  and  those  estimators 
dominate  (Ip-aVS'1)* for  0 < a < K-2  under  the  loss  (2.1.5). 

In  Section  2.4,  we  have  provided  the  simulated  risks  of  five  different  estimators  of 
a under  the  loss  (2.1.5)  divided  by  tr|Q  V)  for  different  choice  of  p,  K,  fi.  y and  Q. 
It  turns  out  that  the  estimator  5,^0)  = (Ip-Xl(l-Cto)[(K-p-l)a-^(p-l VtrlS))!,,] 
+a0[(Kp-2)/tr(fi)Up])X  with  ao  = (K-p-lW[K2(p-D)  has  a definite  edge  over  its  rival 
estimators  when  fl  = Q.  However,  for  fi  far  away  from  Q,  among  the  estimators  dominating 
K.  there  is  no  clear-cut  advantage  of  any  particular  estimator  over  the  others.  Thus,  on  an 
overall  basis,  we  recommend  the  use  of  &ag(K'l  as  an  estimator  of  B- 


We  fust  prove  a result  which  connects  the  risk  of  X with  the  risk  of  fiEB(X)  defined 

estimators  of  the  form  (2.1.8).  We  denote  by  Eg,  the  expectation  conditional  on  fi,  E the 

distribution  of  X and  fi.  Also,  let  Rg(fi,  s(X))  = E2(Lg(fi,  E(X))].  where  Lg  is  defined  in 
(2.1.5). 

Theorem  2,2.1  Under  the  Lg  loss  given  in  (Z1.5),  fi68  defined  in  (2.1.8)  has  risk 
RgCfi.fi88)  = Rg(fl.X)  + Efltr)[S'li(Y,S)Sl(Y.5)S'1  S)S'‘l 


-2(K-p-l)l(Y,S)S'1]Q0), 


(2.2.1) 


where  Qo  = VQy  and  Elisas  defined  in  Lemma  1.3.3. 
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PtraLoOlisorcin  2.2.1  La  iff  00  = dp-ir'lS  and  eP(X)  - (Ip-  yS‘‘lQL  S))&.  i - 
1 K.  Then. 


- E{|(Sl-SP)TQ(fii-flP)) 

- Ed  (a-aP4P-a,EB)TQ(9,-aP4f^P)] 

= ELgffl.  a8)  -f  e(  i (2f -aP)TQ(fif-aPi  i 

= ELgca,  ae) + Etr((&-|icy.s)-rl)sci<i'.  s)a‘'-s'')Qo ) • (2.2.2) 


Next,  using  the  fact  that  Si  1 Xi  has  distribution  (2.1.4).  one  has 

HLgffl.  fi8)  = KtrKIp-Yr'nQ).  (2.2.3) 


Etr(  (s‘i(y.  a)-£-‘)S(ia.  sjs'-jt'jQo) 

= Etr([s-'i(i,  SISKY,  S)S‘'-2£-'£l(X  S)S'1+£'lS£‘,]Qol 
- Etr(  [S'lCY,  S)Sl(i  S)S‘l-2£-'SlO',  S)£‘l]QohKtr(irlQo).  (2.2.4) 
To  obtain  (2.2.4),  note  that  iriff'SKY,  S)S"'Qo)  = tri£''5l(Y.  S)S’‘Qo)t  = ttfQoS'1 
lOL,  S)S£'')  = tK£'ll(y,S)S£'lQo).  and  use  E(S)  = K£. 

Fxom  (2.2.2),  (2.2.3)  and  (2.2.4).  one  has 


ELg&fff8) 


= Kt*YQ)+Eir{  [s'KY,  S)SKV,  S)S’‘-2S-‘SKy,  SJS'lQo)  • 


(2.2.5) 


Note  that  Rg(9.  2£)  = EgLglg,  20  = KutVQ)  for  all  9.  Using  the  Wishart  identity  as  given 

E{£-‘5i(Y,5)S4) 

= E(2qS|Si(Y,S)S'i]-KK-p-1)S-1Si(V,B)S‘1I 

« EtfB*  [SKY,  £)£‘'l+(K-p- 1 )I(i.  S)S‘‘ )-  (2.16) 

It  follows  ftom  (2.2.5)  and  (2.2.6)  that 

ELg&.iD3) 

= ERgOta68) 

= ERg(2, 20+EEatr([S-‘l(V,  S)Sl(X  2)5''  ^ISiO,  S)S''l 

-2(K-p-Ul(Y,S)S1]Qo).  (2.2.7) 

The  conditional  distribution  of  5 = £ given  9 depends  on  9 only  through  £ 9j  9^  = 

aaT  =S(say).  Also,  the  family  of  distributions  of  Q being  Wishart  lV(p,  K.  A)  is 
complete  for  A (or£).  Thus.  (2.2.1)  follows  immediately  from  (2.2.7). 

Remark  2.2.1  The  technique  of  proof  used  in  Theorem  2.2.1  is  borrowed  from  Elton  and 
Morris  (1976b)  (sec  their  Theorem  2).  They  compute  the  risk  of  a special  choice  of  9® 
when  X = Q = Ip.  The  more  general  (2.2. 1 ) is  available  because  of  the  powerful  Wishatt 


identity  given  in 


i 1.3.3. 


Next,  we  will  consider  two  classes  of  EB  estimators  of  the  form  (2,1.8),  first, 
fiEB<l)  = Op-rOt  £)Y£-')20  so  that  1(V.  £)  = KY,  S)Ip,  Where  rCL  S)  being  a scalar  in 
(2.1.8).  Second.  §P(2)  = Up-Y£''!S(£))21,  so  that  l(Y,  £)  = fflfS)  in  (2.1.8),  where  s(£) 
is  symmetric,  a function  of  £ only  and  depends  on  2£  only  through  £.  It  is  shown  in 
Theorem  2.2.2  and  Theorem  2.2.3  that  SEB(,)  and  fiP®  dominate  21  under  the  loss 
(2.1.5)  and  under  certain  conditions  on  r(V,  £)  and  a(£).  respectively. 

Corollary  2.2.1  Under  the  Lg  loss. 


Rg(£  20  + Efltr ( [?0L  £)-2(K-p- 1 IKY,  £)]£'* Q0-  4[dS  i(Y,  £)|Qo  ) . (2.19) 


directly  from  (2.11). 

In  view  of  (119),  an  unbiased  estimator  of  the  risk  difference  Rg(£, 
Rq(S.  20  is  given  by 
hg’Gd.S) 

= l([r2<y.S)-2(K-p-I)i(Y,£)]£-|Q0.4a^r(Y,£)lQB) 

= <r(Qg2[(’2QL  S)-2(K-p-  l)t(V,  £))£-■- 4 (fiS  tfV,  £))]Q^/2) , 


flEB<l>  = (]pT(Y,£)Y£-|)2i. 


(2.2.8) 


(12.10) 


i Corollary  2.2.1 


where  Qo  = qU1^12  sincc  9#* is  symmetric  and  p.d..  Ii  follows  from 
thai  10  find  EB  estimator  aEB,l)  improving  on  X,  it  suffices  to  find  solutions  for  h(g)(Y,  £) 
£ 0 with  Pa(h(^(y,  S)  < 0)  > 0 for  aU  B.  Using  (2.2.10),  it  suffices  to  find  solutions  for 
the  matrix  inequality  H^O. S)  £Q.  where 

H mCL  5)  = [r 2QL  S)-2(K-p-l)r(V,  r(V,  S)).  (2.2.1 1) 

Recall  that  we  write  a matrix  I £ Q if  I is  non-positive  definite  (n.p.d.)  and  I £ Q if  I is 
non-negative  definite  (n.n.d.).  The  following  theorem  provides  a set  of  sufficient 
conditions  under  which  SEB,I)  will  dominate  X. 

Theorem  2.2.2  Let  £ = «Sy>)  and  r (V,  S)  be  a differentiable  function  with  respect  to  Sjj 
for  each  i,  j=  1 p,  then  under  the  loss  (2.1.5).  Rg(fi,  (P'1’)  < Rgffl,  X)  if 

(i)  0<r(Y.S)<2(K-p-l); 

(ii)  ES  ro,  S)  is  n.n.d„ 

Proof  of  Theorem  2.2.2  Since  £ is  pti.,  assumptions  (i)  and  (ii).  and  (2.2.1 1)  imply  that 
UWCL  S)£Q.  Hence  (2.2.10)  is  negative.  Therefore,  the  proof  of  Theorem  2.2.2  is 


Consider  the  special  case  r(X.  £)  = r(F),  where  F = WS.V'1).  The  following 
corollary  to  Theorem  2.2.2  shows  fi68*1*  dominates  X under  the  loss  (2.1.5)  and 
conditions  similar  to  those  proved  in  Baranchik  (1970)  for  EB  estimators  of  vector  normal 
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Corollary  2.2.2  LetriYS)-  r<F).  where  F = triXY’1),  and  r(F)  is  differentiable.  Then 
under  the  loss  (2.1.5),  I0*®  dominates  X if 

(i)  0<r(F)<2(K-p-l); 

(ii)  r(F)  is  non-decreasing  in  F. 

Proof  of  Corollary  2.2.2  Let  Y'1  = «v*i».  In  view  of  condition  (ii)  in  Theorem  2.2.2. 
DVF)  = r'fFKQSF)  = r'OOIMfF))  and  df.F  - 1/20+8^)  1 5kn,vmk)  = A tys 

being  the  Kronecker  deltas.  It  follows  from  assumption  (ii)  that  D^r(F)  = t'(F)-y1  2 11. 
The  proof  of  the  corollary  is  complete. 

Remark  2.2.2  It  is  important  to  observe  that,  when  p = 1,  Corollary  2.2.2  reduces  to  the 
results  in  Baranchik  (1970)  for  independent  normal  random  variables  which  have  a 
common  known  variance.  His  results  on  unknown  covariance  will  be  extended  to  the 
matrix  case  with  unknown  covariance  matrix  v in  Chapter  3. 

A matrix  extension  of  the  James-Stein  (1961)  estimator  is  given  in  the  next 
example. 


Example  2.2.1  Suppose  riV,  S)  = a is  constant.  Following  (2.2.9),  one  gets 

Rq(2,  aEB<1))-RQ(9.X)-  Efltr(a(a-2<K-p-l)JQ"3S',Qi'2|  • (2.2.12) 


Hence,  the  EB  estimator  (Ip-aYS^lX dominates  X.  if  0 < a < 
Op-OC-p-DYS'1)  X 


: 2(K-p-l).  The  optimal 


,F®  We  will  propose: 


several  »(fi)  for 

F®  such  that  F®  is  bener  than  2t  under  the  Lq  loss.  The  risk  Rg(SL.  F2’)  of 
F®  is  provided  below. 


Corollary  2.2.3  Under  the  Lg  loss. 


F® = tip-y  s'ffi<s))x.  <2.2.13) 


Rg(£.F®) 

Rg<a  20  + Hgirl  [S'1!S(S)S!I1<S)S'I-4DSISS!<S)S'I1 
-2(K-p-l)jj(S)S',]Q0)  (2.2.14) 


KY.fi)  wiihfflCS). 


In  view  of  (2.2.14),  an  unbiased  esdmaior  of  die  risk  difference  Rg(fi,  F®>- 
Rg(fi.  20  is  given  by 
h^QLS) 

= Eatr{  [S'lffl(S)fiffi(S)S'1-4I2slS!I!(S)S'll-2(K-p-l)i|Kfi)S'1]Q)) 

» Efltr{Qi,2[S-|a(S)Sa!(S)fi’'-4D5lfiaKS)fi  l]-2(K-p-l)iB(S)S-1]^'2).  (2.2.13) 
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Note  that  fl™®  dominates  Xif  h<g,(V,S))  £0  with  Pfi(h^Q£,  £)  < 0)  > 0 for  all  0.  From 
(2a.  15),  it  suffices  to  show  the  matrix  H®(S)  £ 0.  where 

HC)(S)=S'lst(S)Sffi(S)S'l-4Esisa(S)S'l]-2(K-p-i)a(s)S‘1.  (2.2.16) 

Consider  the  particular  choice  ffi(S)  = R!E(20ET.  where  J, » (X] Xp)T  denotes 

the  vector  of  eigen-values  of  &,  and  ‘F<Xi  is  a diagonal  matrix  with  ilh  diagonal  element 

equal  to  >Fi(D  > 0,  i = 1 p.  Also,  let  E be  an  orthogonal  matrix  with  its  column 

vectors  equal  to  the  onhonormal  eigen-vectors  of  S,  Since  £ = EAEX.  where  A is  a 

(2.2.16)  that  with  the  given  choice  of  ffi(S), 

H<2'(5)  =ES2(i)A'1  ET-4QsiE!E(i)ETl-2(K-p-l)E2'(i)A'1  ET-  (2.2.17) 
From  Haff  (1984).  one  gets 

DS[E!E&).Et)  =E^,,(i)ET.  (2.2.18) 

where  is  a diagonal  mauix  with  ith  diagonal  clement  equal  to 

'if’O)  = J , £.PFi(i)-'Fj(i))/(Xi-Xj)+a4'.a)/3>H.  (2.2. 19) 

From  (2.2.16)  and  (2.2.17),  H®(£)  £ Q is  equivalent  to  U(A)  £ o,  where  u(A)  is  a diagonal 
matrix  with  ith  diagonal  element  equal  to 

Uj(D  = 'F20)X:l-4'F51|(i)-2(K-p-l)4'ia)).:1.  (2.2.20) 

In  the  remainder  of  the  section,  we  find  solurions  of  Uj&)  < 0 for  each  i = 1 p. 

obtaining  thereby  a class  of  EB  estimators  dominating  X 
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The  following  theorem  shows  that  Zheng's  (1986)  Theorem  2 continues  to  hold 
under  the  general  loss  Lg  and  variance  matrix  V.  Zheng  (1986)  proved  the  result  for  Q = Y 

-v 

Theorem  2.2.3  Suppose  that 

(i)  0<4'iQ)<2(K-p-l)  for  each  i-1 p; 

(ii)  3'l'ift)/3Xi  2 0 for  each  i = 1 p; 

(iii)  H'l(2h)'s  are  similarly  ordered  with  the  Vs,  that  is, 

(T'j(D-1'j(i))(V^j) 2 0 for  every  1 £ i,  j £ p. 

Then  Uj&)<0  for  each  i=  l p. 


using  assumption  (i).  The  proof  of  Theorem  2.2.3  is  complete. 

We  provide  two  examples  illustrating  the  use  of  Theorem  2.2.3. 

Example  2.2.2  Stein  (1974).  and  Efron  and  Morns  (1976b)  have  shown  that  if  £'‘ffi(S)  = 
aS‘l-Kb/tr(£))ip,  for  V = Q = ^ then  (^-S'  'ffitSBX  dominates  X for  certain  choices  of  the 
constants  a and  b.  Note  that  one  can  write  £’'®(£)  = EA'1  2KDET,  where  A'1  SKI)  is  a 
diagonal  matrix  with  ith  diagonal  element  equal  to 


, T,‘l|(i)  a 0 for  each  i 


Uj(A)  £ TjOJI-FiOJ^dC-p-Dl?,:'  <0, 


(2.2.21) 


x:1'Pia)-x:l(a+bV.ixj). 


(2.2.22) 


Then  if  a a 0,  b i 0 and  0 < a+b  < 2(K-p-I),  all  the  three  conditions  of  Theorem  2.2.3  are 
met  with  probability  1.  Consequently  for  such  a pair  (a.  b),  the  estimator 

( Ip-y(aS' 1 +(Wti<S))lpl ) X dominates  X for  every  Lg  loss  given  in  (2.1.5).  In  particular,  if 
a = K-p- 1 . then  for  every  b e [0,  K-p- 1 ),  the  dominance  over  X holds.  Also,  if  b = 0.  then 
Gp-aYS'1)  X dominates  X for  evety  a e (0, 2(K-p-l)). 


Example  2.2.3  Suppose  now  ST'fflffi)  = (K-p-l)£’l+(h/tr(£2)]£  with  b 6 [0,  K-p-1). 
Then, 

3'1(D  = (K-p-l)+bX?/£x2.  (2.2.23) 


It  is  easy  to  verify  conditions  (i)-(iii)  of  Theorem  2.2.3  for  this  particular  choice  of  3(G), 
and  accordingly,  Gp-YS‘1Sl(S))X  dominates  X for  every  Lg  loss. 

It  is  possible,  however,  to  obtain  estimators  of  the  form  (2.Z13)  which  dominate  X 
even  though  the  conditions  of  Theorem  2.2.3  are  not  met.  Some  examples  follow. 

Example  2.2.4  Consider  the  case  when  b,  c and  d are  constants, 

3(&)  = b(l+cUd,)'1Jp  (2.2.24) 


and 


(2.2.25) 


is  the  geometric  mean  of  the  eigen-values  of  £.  Then,  condition  (ii)  of  Theorem  2.2.3  is 


22 


u,(i)  = Xj1(l+cU^)‘Ib[b-2<K-p-l)-2cU^(K-p-l-2dp‘1)]  < 0 (2.2.26) 


if  (a)  0 < b < 2(K-p-l),  (b)  c > 0.  and  (o)  0 £ d £ p(K-p-l )/2.  Thus,  Op-b(l+cU,|)-|YS'1) 


i in  (2.1.5)  if  (a)-(c|  hold. 


Example  2.2.5  Suppose  Urn  time 

TOO  - bfl+cU^-'lp  (2.2.27) 

U2=£vp  (2.2.28) 

is  the  arithmetic  mean  of  the  eigen-values  of  S-  Once  again,  condition  (ii)  of  Theorem 
2.2.3  is  violated,  but 

U|(i)  = 1 ( 1 «U  j)'2  b(b-2(K-p- 1 )-2cUj(K-p- 1 -2dXi(pU2)‘1)] 

£ Xj 1 ( 1+cU^)"2  b(b-2(K-p- 1 )-2cU2<K-p- 1 -2d)|  < 0 (2,2.29) 

if  (a)  0 < b < 2<K-p-l),  (b)  c > 0,  <c)  0 £ d £ (K-p-l)/2.  Thus.  Op-Ml+cU^-'YS'bX 
dominates  X for  evcty  loss  given  in  (2.1.5)  if  (a)-(c)  hold.  One  can  also  use 

mi  = b(l+cUj)*Il[),  (2.2.30) 

where 


us-«£ixiV'. 


(2.2.31) 


the  harmonic  mean  of  ihe  eigen-values  of  £.  Then,  flp-b<l+cU^)'1XS"l)2i  dominates  2£  for 
every  L<j  loss  under  conditions  (a)-(c)  of  the  example. 

Example  2.2.6  We  consider  now  a class  of  estimators  fs'ictsila  of  21" 1 , where 

IS-'ffi(S))a  “ ( 1 -a)[(K-p- 1 )S"  ‘-.(WtrfSJiy  ■HXl((Kp-2)/tr(S))Ipl . (2.2.32) 

OS  a S 1.  An  alternate  representation  of  ( £' 'a(S)  1 a ‘5  E A' 'lECi) ET.  where  £&)  is  a 
diagonal  matrix  with  ith  diagonal  element  equal  to 

T-iO)  - avb„V  £Xj,  (2.2.33) 

r i 

where  a = (l-ot)(K-p-l),  bp  = (l-a)b+o(Kp-2). 

Note  that  from  Example  2.2.2.  (^-V  (£'lffi(S)la)!<  dominates  H for  all  Lq  if  0 < 
a+bo  < 2(K-p-l).  i.e..  0 < bo  < (l+«)(K-p-l)  or  0 < b < ((l+o)(K-p-l)-a(Kp-2)]/(l-o). 
Note  also  that  the  above  holds  only  if  a 6 (0,  (K-p-l)/I(p-l)(K+l)l). 

Remark  2.2.3  Efron  and  Morris  (1976b)  chose  b = p2+p-2  and  showed  the  dominance  of 
the  corresponding  Q^-  (£'1ffl(£))0)X  over  X under  the  squared  error  loss  and  V = 1^.  We 
do  not  have  such  a strong  result  regarding  dominance  of  (1^-V  (S"1a(£)lc,)X  over  X 
simultaneously  for  all  Lq.  However,  we  shall  see  from  Section  2.3  that  a very  appropriate 
choice  of  b is  p- 1 . With  this  particular  choice  of  b,  it  follows  from  (2.2.20)  that 

Ui&)  - (a2-2(K-p-l)a)Xjl+(2abo-2(K-p-l)bo-2(p-l)ho-dbo)/(^j) 
+b^(|xj)2^b0ty(J,Xj)2 
S (a2-2(K-p-l)a-2(K-2-a)bo+bJl/(  £Xj). 


(2.2.34) 
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Note  that  the  right  hand  side  of  (2.2.34)  is  negative  for  a € [0,  (K-p-l)<K2(p-l)) 1 
(1 +[  1 +K2(K-p- 1 )‘2((K-p-  1 )2+(p- 1 )2)] 1/2 ) ] , and  the  function  is  minimized  at  <x0  = 
(K-p-l)/[K2(p-l)].  We  propose,  therefore,  the  estimator 

SaoQO 

= 0p-Yl(l-cto)[(K-p-l)S-1+((p-l)/trlS))lp]'«Xo((Kp-2)/tr(S)lIpl)X.  (2,2.35) 

2.3  immovement  On  (lr-r(V.  S)Y  £'')X 

Consider  §P(I,(X)  = Op-rQ(,  SJii  where  rQL  S)  is  differentiable  with 

respect  to  Sjj  for  each  i.  j = 1 p.  and  let  P<3,(X)  = Up-rtY,  £)Y£'1-YS'1£(£))X. 

where  £(£)  = E3£(i)  ET.  !E(i)  is  a diagonal  matrix  with  ith  diagonal  element  equal  to 

3*  j&)  > 0,  i = 1 p,  as  before  E is  an  orthogonal  matrix  with  ith  column  vectors  equal 

to  the  orthonormal  eigen-vectors  of  £,  and  X * (3.j , ....  Xp)  denotes  the  vector  of  eigen- 
values of  5.  Then  from  (2.2. 1),  one  gets 

Rglfi,  fiEB(3)(X))-Rg(&.  S^fX)) 

- Egtr{  [S  ,f<S>Sf<SJS'1+rCYL,  S)IS'1£(S)+£(a)S-,]-4I^)S£(S)S-1l 
-2(K-p-l)£(S)S’l]QB) 

- Hatr(Qj,JE(ai2a)A-|-2(K-p-l-r(Y,S))m)A-|-4S(l,a)JETQj'2)  <0, 


forallfiif  Aoi(i)  <0  for  each  i = l,  ...,p,  where 

AffiO)  = 3'fa)X:‘-2(K-p-l-r(y,S))'l'i(DVi1.2  E mOJ-WjOJl/fti-Xj) 


-4[34'i&)/3Xi|. 


(2.3.1) 


> 4oj(D  < 0 for  each  i = 1, 


Theorem  2.3.1  Let  0 < rfY,  S)  < (K-2)  and  tQ£,  5)  is  differentiable  with  respect  10  Sy  for 

each  i.  j = 1, ....  p.  Suppose  1'iQJ  = Xjt(Uy(£Xj),  i = 1 P.  where  U is  a function  of 

j-l 

the  Vs.  Suppose  that 

(i)  UO)  is  non-decreasing  in  X*  for  each  i =*  1, ....  p; 

(ii)  t(U)  is  non-decreasing  in  U; 

(iii)  0 < tfU)  < 2(K-r(y,  S)-2). 

Then  %&)  < 0 for  each  i = 1 p.  Hence.  fiEB<3)(X)  dominates  0EB(I,(X)  under  the  Lg 

%(D  = Xit2(U)/(ixj)2-2(K-p- 1 -r(X  S))t(U)/(  £Xj>-2(p- 1 )t(UV(  £x,) 
j=l  j.1 J j-l 

^[tcuy^-Xjtfuvf iv^V.^t'fuxau/av] 

= [t2(U)+4t(U)]Xi/(^l2-2(K-rO'.  S))t(Uy(£xp 
-4(V.iXj)t'(U)OU/3Xi) 

£ [t2(U)^I(U)]V(^)J-2(K-r(y,S))l(W(£v 
S [t2(U)-2(K-rQ'.  S)-2)t(U)l/(  £Xj)  < 0. 


(2.3.2) 
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The  first  inequality  in  (2.3.2)  uses  parts  (i)  and  (ii)  of  the  assumption,  while  the  last 
inequality  uses  part  (iii).  The  proof  of  the  theorem  is  complete. 

Example  2.3.1  It  follows  as  a special  case  of  Theorem  2.3.1  that  when  iQC  S)  = a is 
constant.  dp-aV  S''-[«U)/tr<S)lY)X  dominates  dp-aYS'Hi  whenever  0 < a < (K-2)  and  0 
< «U)  < 2(K-a-2).  For  the  particular  choice  of  a = K-p-1,  0 < t(U)  < 2(p-l).  In  Dey, 
Ghosh  and  Srinivasan  (1985),  the  dominance  holds  for  all  0 < t(U)  < 2(p2+p+2),  but  only 
under  the  squared  error  loss  and  Y = 1 p. 

Example  2.3.2  If  t(U)  = b is  constant,  and  a = K-p-I,  the  first  inequality  of  (2.3.2) 
becomes  an  equality.  From  the  penultimate  step  of  (2.3.2),  it  follows  that  the  optimal 
choice  of  bisp-l. 


Remark  2.3. 1 It  may  be  noted  that  the  choice  of 

U = I ffl1*  = fix}*,  tr (S)  = ix/p  or  p/ftrffi-'))  = p(£\;')-'  (2.3.3) 

i-t 1 i-t  >1  ‘ 

(the  geometric,  arithmetic  and  harmonic  mean  of  the  eigen-values  of  S)  all  satisfy  condition 
(Oof  Theorem  2.3.1. 


These  estimators  are  X and  % = dp-  YS''l<j)(Y,  S))X  (j  = 1, 2, 3, 4. 


I(t)(YS)  = (K-p-1  (Ip+Kp- 1 )/u(S)]S ; 
I<2)(Y  S)  = (K-p- 1 )Ip+((p-  l)/tr(S2)]S2; 


(2.4.2) 


1(3)0.  a = (K-p-lHp!  (2.4.3) 

1(4)0.  a = (l-°o){(K-p-l)lp+[(p- 1 )/tr(£)15 1 +«o  { I(Kp-2)/tr(S)lS  I . (2.4.4) 


where  Op  = (K-p-l)/IK2(p-l)]. 

Through!,  we  have  considered  the  loss 

LgCfi,  a)=  I (fl,-ai)TQ(2riU)/'rtQ  Y).  (2-4.5) 

The  (K.  p)  combinations  chosen  are  (4,  2),  (8,  2),  (12,  2),  (5.  3),  (9.  3)  and 
(13. 3).  For  each  p,  1000  samples  are  generated  from  Np(9i.  Y). 

For  p = 2.  we  have  taken  ft  = Q or  ft  = (10, 35)T  for  each  i = 1 K.  For  each  p 

= 3,  our  choice  is  ft  = fl  or  ft  = (6, 10, 50)T  for  each  i = 1, ....  K.  Corresponding  to  a 
given  situation,  for  each  (K.  2)  pair,  there  are  four  (Q,  Y)  combinations  with  each  Q or  Y 
equal  to  or  M2  = 4iag(0.0 1 . 1 .00).  For  each  (K.  3)  pair,  there  are  once  again  four  (Q,  Y) 
combinations  with  eachQorYequal  10 13  or  M3  = diag(0.01, 1.00, 100.00).  Thus,  in  this 
situation,  for  each  (K,  p,  Q.  Y,  fit),  the  simulated  risk  is  the  average  of  1000  expected 
losses.  Tables  2. 1 and  2.2  report  the  simulated  risks  for  ft  = Q (px  1 ) while  Tables  2.3  and 
2.4  report  the  same  when  ft  = (10, 35)T  and  ft  = (6, 10, 50)T,  respectively. 

An  examination  of  Tables  2. 1-2.4  reveals  that  like  the  usual  James-Stein  estimator, 

point  towards  which  we  want  to  shrink  (in  this  case  Q (pxK)).  As  the  true  ft  gets  further 
impressive  even  when  the  ft's  are  far  away  from  zero's  except  when  Y = Ij  “"d  Q = Mj 

orY=^andQ  = M3. 

Another  point  to  note  is  that  for  fixed  p,  Q,  and  Y,  the  risk  improvement  of  each 
estimator  increases  over  K.  The  improvement  from  K = p+2  to  K = p+6  is  much  more 


i K = p+6  to  K = p+10. . 


Also,  it  may  be  noted  that  8^,  Bg) 
and  g(4)  all  ouipcrfomi  Bq)  in  terms  of  risk  improvement  when  the  true  Bj  values  are  zeros, 
although  for  Si's  far  away  from  zeros,  all  the  four  estimators  are  nearly  at  par.  Finally,  we 
observe  that  8^  has  a consistent  edge  over  the  remaining  estimators  when  Si's  are  zeros, 
while  S^4j  performs  as  well  as  the  others  when  Si’s  are  far  away  from  zeros.  Thus,  we 
recommend  the  use  of  S,4)  in  the  present  context 
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CHAPTER  3 

EMPIRICAL  BAYES  MINIMAX  ESTIMATORS  OF  MATRIX  NORMAL  MEANS 
FOR  ARBITRARY  QUADRATIC  LOSS  AND  UNKNOWN  COVARIANCE  MATRIX 
IN  THE  ONE-SAMPLE  PROBLEM 


3.1  Introduction 

when  the  covariance  Yof  Xi's  is  known.  In  a more  realistic  situation,  V will  not  be 
unknown,  but  we  have  supplementary  data  to  estimate  it.  This  is  the  situation  in  most 

(2.1.1)  and  (2.1.3),  but  this  time  Y is  unknown  and  an  estimator  Y.  a symmetric  matrix 
independent  of  X,  is  available  as  an  estimate  of  V.  It  is  desired  to  estimate  9 under  the  loss 

Lg-(a.a)=  I(9rai)TQ-(firai).  (3.1.1) 

where  Q*  = y-'«Qy-|/2,  y = y'%l/2,  and  Q is  p.d.  and  known.  It  can  be  shown  that  the 
usual  estimator  X of  9 has  constant  risk  Ktr(Q)  for  all  9 under  the  loss  (3. 1 . 1 ) instead  of 
the  value  Ktr(YQ)  under  the  loss  (2.1.5).  In  this  way  X is  a constant  risk  minimax 
estimator  of  9,  otherwise  under  the  loss  (2.1.5)  every  estimator  of  9 is  minimax  having  the 

Now  the  general  class  of  EB  estimators  of  9 is  modified  as 

YS''l(Y.S))X,  (3.1.2) 

where  KY.S)  is  a symmetric  pxp  matrix.  Under  the  loss  (3.1.1),  the  following  theorem 
shows  the  relationship  between  the  risks  of  X and  a general  EB  estimator  9^(5,  X)  of  the 
form  (3.1.2).  In  this  chapter,  we  denote  by  Eg  v.  the  expectation  conditional  on  9 and  V, 
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E the  expectation  over  the  marginal  distribution  of  2,  and  E the  expectation  over  the  joint 

distribution  ofX.fi  and  Y- Also,  let  Rg*<a,  £<2.  X))  = Efil,ILg*(fi.£<2.X))].  where 

Lq"  is  defined  in  (3.1.1). 

Theorem  3.1.1  Under  the  Lq"  loss.  SEB  defined  in  (3.1.2)  has  risk 
Rg-lfi.!?3) 

= Rg-ia,  X)  + Eflitr(S'1I(Y,  S)Sl(2.  S)S'‘2  Q'Y-tcStSifY.  S)S-ll2  ffi! 
-2(K-p-l)KY.S)a'lYQ'y).  (3-1-3) 

Proof  of  Theorem  3.1.1  This  proof  is  similar  to  the  one  in  Theorem  2.2.1.  LctflP(Y,  20 

-(V2S-'K2.S))Xi,i  = l K.  Then, 

ELg-ffl,?8) 

- E(|i(firfiP)TQ-(Si-fiP») 

= ELg-(fi,  fi8)  + El  £ <fif^P)TQ'(af-eP) ) 

= ELg-(fi,aB)-hEtr((YS'lK2.s)-Yrl)S(i(2.s)a'l2-2r'iOQ-).  (3.1.4) 
where  fi?  is  defined  in  (2.1.6). 

Etr(  (2s'‘k2,  a)-Yzr')sa(2,  s)s-‘2-r‘Y)Q' ) 

= Etr(s‘'i(2.s)ai(2,a)s-1  2q-2-2£-'sky,s)S'i2q,y 


^■SS'YQ’V)) 


= Etr{S-'i(Y,  S)Sl<2.  S)S‘2  Q-y  -2£-‘Si(Y,  S)£'12q"Y) 

t-Ktr(£‘lYQ'Y),  (3.1.5) 

since  tr(2s'1i(2.  S)££''YQ')  = trft-'KY.  S)S.£'IY  Q’Y)  = trffi-'KY,  S)S.r'Y 
ff2>T = trfvg-vs-'sKY,  s)s->)  = BQts-'aaS.  aa-'So-)  = bqi-'&icy.  £)£-' 
YQ’Y)  and  E(£)  = KS. 

Substituting  Q = Q'  into  (12.3)  and  using  (3.1.5),  one  has 
ELg-ffl.!?8)  - KtrtY  Q*)+Etr{  S_li(Y,  £)£l(Y,  S)S''Y  3*2 

-2jr1s3x2.a£‘‘2crYl.  (3.1.6) 

Note  that  Rg-(6, 20  = %yLg-(a,  20  = KtrfYQ")  = Ktr(Q)  for  all  6 and  Y.  Also, 
applying  the  Wishart  identity  as  given  in  Lemma  1.3.3,  one  obtains 
ELg'&S®8) 

- ERg-(fi,a6B) 

= ERg-ia  20+EEaiLir(s-|i(2.  S)Si(2.as'12o‘2 

-4B?ki(2.  s)a-‘)2  q"y-  2(K-p-i)g(Y,  s)s-,2  q*y).  (3.1.7) 

Note  that  Y is  independent  of  K (and  £)•  Again,  the  conditional  distribution  of  £ - 
X XjXj  given  a depends  on  & only  through  X 2j  fljT  = S6T  = S(say)  and  2 is  ancillaty. 

Also  the  family  of  distributions  of  S being  Wishan  lV(p.  K,  A)  is  complete  for  A (or  £). 
Thus,  (3.1.3)  follows  immediately  from  (3.1.7). 


(i)  Y = vQ,  Q known  symmetric  p.d.,  and  v > 0 unknc 


i-y2,v-v(nt2)^,  (3.1.8) 

where  y_n  is  chi-square  distribution  with  n degrees  of  freedom; 

(ii)  Y = Yd  = tiiag(v, Vp).  a diagonal  matrix  with  ith  diagonal  element  vj  > 0 

Yd  = diag(V| Vp).  Vj  - Vjfo+2)’‘xJ.  independenly;  (3.1.9) 

(Si)  Yp-d.  and  entirely  unknown.  With 

Y-(n+p+l)-%(p.n.Y).n>p+l.  (3.1.10) 

The  above  three  cases  had  been  considered  in  Efron  and  Morris  (1972b).  They  showed  that 
the  particular  EB  estimator  (Ip-(K-p-l)V£'')X  dominated  X under  the  loss  given  in 
(3.1.1)  with  Q-  Ip. 

Results  similar  to  the  ones  in  Chapter  2 are  shown  to  hold  (Section  3 .2-3.4)  for  the 
cases  (3.1.8M3.1.10).  In  Section  3.2-3.4,  we  first  provide  two  classes  of  EB  minimax 
estimators,  namely  and  9EB<2),  of  the  fonn  (3.1.2)  under  the  Lg-  loss,  where 

fl™,)(Y.X)-(lp-r<Y.S)YS'l)X.  (3.1.11) 

where  r(Y,  S)  (>  0)  is  a scalar,  and 


X)  = Qp-Y  S'sa(S))X. 


(3.1.12) 
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where  g>(£)  is  a symmetric  p.d.  matrix. 

fiPO  with  certain  r(  V,  £)  under  the  Lq-  loss,  where 

PM,  x>  - dp-r<5.  s)2s'1-2rlK5))x,  0.1.13) 

and  f(S)  is  symmetric  as  defined  in  Section  2.3.  The  risk  difference  of?8®  and  fi8801  is 
given  in  the  following  theorem. 

Theorem  3.1.2  Under  the  Lq-  loss  with  f(S)  = E21(D  ET.  31(1)  is  a diagonal  matrix  with 
ith  diagonal  element  equal  to  'Pj(D  > 0.  and  assume  r(2,  S)  is  differentiable  with  respect 
tosjj  for  each  i.  j=  1,  ...,p. 

Rq*(fl.  ?8(3)(2.  X»-Rq-(tt.  PM,  X)) 

= H2Ytr(ECE2a)A',+2r(2.S)3;(i)A'IlET2Q‘2 

-E14I* 1 )(i)+2(K-p- 1 )!E(DA' 1 1Et2  Q' V ) . <3.1.14) 

Proof  of  Theorem  3.1.2  In  view  of  0-1-3),  one  gets 

Rg-(fl,  ?W3)(2,  X))-Rq-<a,  PM,  20) 

= Ejytrl  [S‘,f(S)5X(S)S'1+r(y.  5)[S’,£(S)+fiS)S’1l]2  Q*2 

-[4EsI5X(S)S-1]*2(K-p-1)£(5)S-|]2Q'Y).  0-1-15) 

Hence  (3.1.14)  is  obtained  by  using  £(S)  = E3KDET  and  (2.2.18),  and  die  proof  is 
complete 
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Finally,  in  Section  3.5,  five  different  estimators  of  fi  are  compared  according  to 
their  simulated  risks  under  the  loss  (3.1,1)  and  under  three  different  assumptions  of  Y 
given  in  (3.1.8H3.1.10).  We  continue  the  choice  of  p,  K.  2,  V and  Q used  in  Section  2.4. 
As  before,  the  estimator  2o„<2.  XJ  = (lv-2[(l-<»o)((K-p-l)S'1+((p-lVti(S))Ipl 
+a0l(Kp-2)/tr(S)]Ip])X  with  suitable  choice  of  o0  has  a definite  edge  over  its  rival 

X,  there  is  no  clear-cut  advantage  of  any  particular  estimator  over  the  others.  Thus,  on  an 
overall  basis,  we  recommend  the  use  of  §oq(2.  20  as  an  estimator  of  2- 


□*  loss  and  ¥ = v Q,  Only  v Unknown 


V = vQ,  where  fl is  known  symmetric  p.d..  Also  v - v(n+2)''x„  is  available.  In  the 
balanced  one-way  MANOVA  model  introduced  in  (2.1.2),  suppose  Yo  = v0  (2  known 
symmetric  p.d.).  n = Kp(m-1),  and  V = m'*vQ.  The  best  multiple  estimator  of  v is  given 


v -n(S  IOirXi)Q'1a£irXi)T)/(n+2).  (3.2.1) 

Note  that  conditional  on  2,  Xi's  and  v (or  V)  arc  independent,  and  the  distribution  of  v 
does  not  depend  on  2-  Also  marginally,  (£,  Y)  is  the  complete  sufficient  statistic  fi>r 
(£,  Y).  where  Y = m'*Yo- 

In  this  case,  the  estimators  2EB<1)  and  §PC)  defined  in  (3.1.11)  and  (3.1.12)  are 
modified  as 


^■’(vq.  x) = Op-r(Ja  s)vas-')x 


(3.2.2) 


20 = Op-vQa-'aa))^ 

First  the  risk  of  ?®®(v  G,  20  is  calculated  as  follows. 


(3J2.3) 


Corollary  3.11  Under  the  Lg-  loss  defined  in  (3.1.1).  flEB(1,(v  & 20  has  risk 
Rg-Oi,?8™) 

= Rg’Cfl.  20  + Ejytrd^Cvfl,  S)-2(K-p-l)r(y  £L  S)]vS'1'2G1'JQflW£'l/2 
-4(Dsr(v£i,JDFawQGl/2 

t^IvVa  S)[dr(v£L  S)/3v  lS1/2fi,/2QC1/2S.'W-  (3.2.4) 

Proof  Of  Corollary  3.2.1  Let  I(i.  S)  = *2.  Sip  and  2 = ?fi-  H follows  from  (3.1.3)  that 

Rg-ffl.jpH')) 

= Rg’O.X)  + Hgvo-{[r2(vQ.S)v2S'l-4[D^r<v2.  S)l" 

-2(K-p- 1 )r(v  Q,  SJvvS' 1 ]Q  Q'  Q)  • (3.2.5) 

Next,  applying  the  formula  for  the  chi-square  distribution  in  Lemma  1.3.2. 

E,[*vas)vV'i 

- Ev((jS^a^+(^)v(r^vQ.5)+v(ai2(v£LSVW)l) 

- Hyi^asiv-K^jvVaatarfvasvav)). 

where  v - and  r is  such  that  all  expectations  in  (3.2.6)  exist. 


(3.2.6) 


Combining  (3.2  J)  and  (3.2.6),  i 


Rq'(8,  §EB,I>) 

- Rq-(8. X)  + Eflvir([i2(vQ,a))-2(K-p-l)r(?aS)lvvS-|fiQ‘a 

-4aA(5as))^Q's+(^2v'(va.s)(3r(vasy3v)5’l2Q‘a(3.i7) 
Recall  dial  Q’  =ylflQyIB  = v1C,/JQQ’W.  one  has  (3.2.4)  from  (3.2.7).  The  pn»f 

The  following  theorem  is  an  immediate  result  of  Corollary  3.2.1. 

Theorem  3.2.1  ifB(l,(v  Q.  20  of  the  form  (3.2.2)  dominates  K under  the  Lq"  loss  if 
0)  0<r(?fi,S)<2(K-p-l); 

ffl)  r(v  G,  S)  is  non-increasing  in  v ; 

(in)  E^r(vQ,S)l  is  n.n.d.. 

ProoLef  Jhegtnn  32,1  Since  S-Mqi/2QGi,2S:1'1  and  GWQG1'2  are  symmetric  p.d„ 
and  v > 0 with  probability  1 . It  is  easy  to  see  that  Rg-(8,  20  < 0 under  the 

assumptions  (i)-(iii).  This  concludes  the  proof. 

As  in  Corollary  2.2.2,  we  consider  the  special  case  rfv  o.,  s.)  = r(F)  where  F = 
tr(£.y')  = v-1  trtSQ'1)  in  the  next  corollary. 

: F = v-1  tr(££± ').  and  r(F)  is  differentiable, 
dominates  2J  if 


Corollary  3.2.2  Let  rivO.  SI  = rtFl  where 
Then  under  the  loss  (3.1.1),  S'3*0’!?  0,20 1 


0)  0<rtvO,S)<2(K-p-i); 


(ii)  r(F)  is  non-decreasing  in  F. 

Proof  Of  Corollary  3.2.2  Since  WSG"1)  > 0,  il  is  obvious  lhai  condition  (ii)  in  Theorem 
3.2.1  is  satisfied.  Also.  D^r(F)  = r'(vQ)‘1  2 Q by  using  condition  (ii)  and  Q is  p.d..  We 
complete  the  proof. 

K-p-1  which  leads  to  the  matrix  extension  foim  of  James-Stein  estimator  (^-(K-p-lJv  Q. 
£'l)X.  Obviously,  all  the  conditions  of  Theorem  3.2.1  and  Corollary  3.2.2  are  met. 

Remark  3.2.2  From  the  proof  of  Theorem  3.2.1.  it  follows  lhat8EB‘l*(vQ,X)  dominates 
X under  the  conditions  (iMiii)  of  Theorem  3.2.1  and  Lg  loss  given  in  (2.1,5). 

Before  we  show  in  Theorem  3.2.2  that  some  S^fvQ,  X)  of  the  form  (3.2.3) 
also  dominate  X under  the  Lg"  loss,  the  risk  of  fp^fvlj.  X)  is  derived  here. 

Corollary  3.2.3  The  risk  of  BB  estimator  X)  of  the  form  (3.2.3)  can  be 


Rg-(fi.2EB(2)) 

= Rg*<a,  x> + Eg.trf^v  Ql/2al/2LS-,t(5)Sai(S)S‘1-4i2s[Sa<S)S-1] 
-2(K-p-i)aKS)S'1]aI/2Qlfl). 


(3.2.8) 
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RQ.(a.eEB®) 

= Rg-(fi.  X)  + Eavir{[s-‘ffi(S)aa(a)S'l52-4Qsisa(S)a’1i?v 

-2(K-p-l)ffi(S)S'l?v]aQ'Q).  (3.2.9) 

Since  5 is  independent  of  X and  S,  and  v ~ vfn+2)-‘  ^ . one  has 

= R<j-(fi,X)  + Eflvir([a-|a(S)£iE(S)a’,-4l2SlSS!(S)5'1] 

-2(K-p-l)a(S)S',]S72V2CQ’Q).  (3.2.10) 

The  final  expression  (3.2.8)  follows  from  (3.2.10)  by  the  fact  Q‘  - yWgy'f2  . 
v-Iq-1/2qq-1/2  jiu  proof  is  complete. 

We  continue  to  assume  a(S)  = £!E(i)  ET  as  in  Section  2.2.  Then  from  (2.2.16)- 
(2.2.20),  we  have  the  following  theorem. 

Theorem  3.2.2  Suppose 

(i)  0 < 4'id)  < 2(K-p-l)  for  each  i - 1 p; 

(ii)  d'ifQJ/JXj  a 0 for  each  i = 1 p; 

(iii)  ^(D'S  are  similarly  ordered  with  the  Vs,  that  is, 

(>Ki(D-yj(i))(Xj-XJ)  a 0 for  every  1 S i.  j S p. 

Then  fi88®  of  the  form  (3.2.3)  dominates  X under  the  Lq-  loss. 


; See  (2.2.16)-(2.2.20)  and  the  proof  of  The 


i 2.2.3. 
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Remark  3.2.3  The  mi's  proposed  in  Example  2.2.2-2.2.6  and  Remark  2.2.3  validale 
fPm(va.  X)  = (If,-vGS'lS(S))X  as  a betier  esiimaior  than  X under  the  Lg-  loss. 

Remark  3.2.4  In  Theorem  3.2.2,  it  can  be  shown  that  the  dominance  of  jP(2>  over  X 
continues  to  hold  under  the  same  conditions  and  the  Lg  loss. 

Although  we  have  shown  estimators  aEB°l  of  the  form  (3.2.2)  to  be  better  than  X 

form  of  the  estimator  jpG)  of  g defined  in  (3.1.13) 

iP^tvO.  X)  = (Ip-r(vG.  5)vGS'l-vQa'1£(S))X,  (3J.11) 

(P^G.  X)  = Op-KvG,  S)VGS'')X,  (3.2.12) 

where  r(vQ,  g)  is  differentiable  with  respect  to  Sjj  for  each  i,  j = 1, ...,  p,  and  r(v  (j_.  g)  is 
non-increasing  in  v . Follow  the  assumption  V = vQ.  with  v - v(n+2)‘ and  Lemma 

(1.3 J),  one  has  Ev(52t(vG.S)I  - vEvtv  r(vG,S))+2v(n+2)-|E„[^3r(va.  SJflv). 
Hence  (3.1.14)  yields 

Rg.(g,  fP°>(vG,  X))-Rg'(fi,  iP(,,(vG,  X)) 

- Eflytr|ECE2GJA‘l+2r(vG.S)!E(i)A'llETv2QQ’G 
-EI43^l,(i)+2(K-p- 1 )!C(&)A'l]ETv  v Q Q"GX 
= Efi  vtrlvQ1'3Gl/2EO(2(i)Al-2(K-p-l.r(CG,  S))±(i)A'' 


s)/35  )1EtQwQ,/s  I 

S Eg  vir  (JQ'^G^m;2^1  -2(K-P-  l-r(?  Q,  S)mi)A' 1 

-4±0,(i)]ETCl/2Q1/3)<0.  (3.2.13) 

for  all  a and  v if  AU(D  < 0 for  each  i = 1, ....  p.  where 

A,i<i)  = i’fa)x;,-2(K-p-i-r(?Q, sOTiO)*.;1^  x vr-,<&>-'rjQi)vo*-ty 

-404'iQJ/a^i).  (3.2.14) 

Hence  we  have  a generalization  of  Theorem  2.3.1. 

Theorem  3.2.3  Lei  0 < r(v  S,  S)  < (K-2),  r(5Q,  S)  is  differentiable  with  respect  to  Sy  for 
each  i,  j = 1 p.  Suppose  %&)  = Xjt(U)/(  |>u),  i = 1 p,  where  U is  a function  of 

H 

the  Vs.  Suppose  that 

(i)  r(v  Q,  S)  is  non-increasing  in  v; 

(ii)  U(D  is  non-decreasing  in  Lj  for  each  i = 1,  ..„  p; 
fui)  t(U)  is  non-decreasing  in  U; 

(iv)  0<t(U)<2(K-r(vS.S)-2). 

Then  sFB(3>(?G.  20  dominates  §“">(?  Q,  20  under  the  Lq-  loss. 

Proof  of  Theorem  3.2.3  Same  as  the  proof  of  Theorem  2.3.1  with  the  obvious  change  of 
r(V.S)intor(vQ,S). 


Examples  2.3.1  and  2.3.2  and  Re 


die  following 


Remark  3.2.5  When  d»Q,i)  = a is  constant.  0p-a?QS‘1-I'(U)/tr(S)lvC)2i  dominates 
(Ip-avas'bx  whenever  0 < a < (K-2)  and  0 < t(U)  < 2(K-a-2).  For  die  particular  choice 
of  a = K-p-1. 0 < i(U)  < 2(p-l).  If  KU)  = b is  constant,  and  a = K-p-1,  it  follows  that  the 
optimal  choice  of  b is  p-1 . Note  that  when  U equals  the  geometric,  arithmetic  or  harmonic 
mean  of  the  eigen-values  of  S.  then  the  condition  (ii)  of  Theorem  3.2.3  holds. 

Remark  3.2.6  In  Theorem  3.2.3.  iP^lvfi,  X)  also  dominates  X)  under  die 

I^jlOSS. 


q-  loss  and  Y = Yu  an  Unknown  Pwgonal  Matrix 


equal  to  Vj.  Assume  the  estimator  Vj  = diag(V| , ....  vp)  is  available,  where  Vj  is 
independently  distributed  as  Vj(n+2)‘'xJ.  Also  Xi  and  Vj  are  independent.  For  example, 

suppose  Yo  = - diag(v, vp),  an  unknown  diagonal  matrix  with  ith  diagonal  element 

Vj  > 0.  in  the  balanced  one-way  MANOVA  model  given  in  (2.1.2).  Let  n = K(m-1),  V = 
= diag(V|,  ....  vp),  where  v(  is  [m(n+2)]-1  times  the  ith  diagonal  element  of 
( £ £ (Xij-XiKXg-Xi)T).  Then  v(  - Vj(m(n+2)]'1j;^  independendy.  Also  conditional  on  fl, 
Xj’s  and  V are  independent,  and  the  distribution  of  V docs  not  depend  on  Q.  Marginally, 
(S,  V)  is  the  complete  sufficient  statistic  for  (£,  Y).  It  is  desired  to  estimate  6 under  the 


in  their  risks.  Lei 
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20  = (Ip-rtYd.  S)Ydfi'‘)X. 


(3.3.1) 


where  r(Yd.  5)  (>  0).  and 


S^CYd.  2D  = (Ip-YdS-'aKS))^ 


(3.3.2) 


where  ffl(S)  is  a symmetric  p.d.  matrix. 


The  following  corollary  provides  the  risk  for  flP^Yd.  2D  under  the  loss  (3.1.1) 
withQ-^QY/2. 

Corollary  3.3.1  Yd.  2D  as  defined  in  (3.3. 1)  has  risk 


= Rg-(a.2D  + Eaidtr|(^)r2(Yd.S)S'1YdQ‘Yd 
-K^j)  t^lYd,  S)&*lYddiag(Q")Yd*2(K-p-l)r(Yd,  S)S°  YdGfYd 
•H^MYd.fflS-'YdlYrtYd.  SJlYdCYd-KD^Yd.  SllYdQ'Yd).  (3.3.3) 

where  diag(Q*)  = diag(qj , q^),  Q'  = «q".)),  and  £r(Yd.  S)  is  a diagonal  matrix  with 

ilh  diagonal  element  equal  to  3r(Yd.  S)/9v,. 


I Y = Yd-  It  follows 


Rq-O,  &EB(i)) 


= Rg-lfi,  X)  + Ej^trfr^Y*  ^-|YdffYd^!BS'<Sd.S>]2dQ'}!d 

-2(K-p-i)r<2<i.s)S‘'yds-yd). 


(3.3.4 


LaE^Yd,  S)iQl2dQ‘2dl  = E^aa  = ^(c^)],  where  Wjj  = ASd.fflvftqJvj- 
Since  vj  and  Jfc's  are  independent  and  v,  ~ vj(n+2)''  x„.  “sing  Lemma  1.3.2,  one  gets 


- Ev.[rz(Yd.S)vTlv?q‘| 

= EVi((i^2d.avi+(^2)v?i<2d.S)(at(2d.S)flvi)lqi'i)  (3.3.5) 


Ey.IWjjl 

- Ev.lr2(2d.5)v:,viqi'vjl 

= Ev.|  K^j)  S)-K^j)  ?if(2d.  SX^Yd.  S)«?i)]q'j?j ) • (3.3.6) 


Combining  (3.33)  and  (3.3.6)  to  get 

EflVd“1r2(2d.S)5'l2d9’2dl 

‘ E9Ydtrl^)  **&*•  SJS-'Yd  Q‘Yd*<^)  r^Yd.  S)S' % ^(Q'lYd 

+<^)  rtYd.  S)S‘Ydl2rt2d.  S)]2dQ’idl 
* ^Yd^O  *2d,  S)S‘2dQ’Yd-H^)  ^(Yd.  5)S',2ddiag(Q’)Yd 


k^)  <<2d.  m'VdtSrtid.  snXdflfSU 


(3.3.7) 


Substituting  (3.3.7)  into  (3.3.4)  gives  (3.3.3).  Corollary  3.3.1  is  proved. 

Assume  the  loss  function  (3.1.1)  with  Q = &,  = diag(q, ! qpp).  Recall  that  Q = 

((qy)).  The  dominance  conditions  of  over  3(  arc  provided  in  the  next  theorem. 

Theorem  3,3.1  Suppose  Q = Qd  = diag(qu qpp)  in  (3.1.1)  and 

CO  0 < t(S(|.  fi)  < 2(K-p-l); 

6i)  rt2d.  S)  is  non-increasing  in  5j  for  each  i =1 p; 

(iii)  rtSd.  S)  is  non-decreasing  in  s„  for  each  i =1 p. 

where  S = ((Sjj)).  Then  Rg-(fi.  f8*'*)  < Rg-(JL  3D- 

Proof  of  Theorem  3.3.1  From  (3.3.3),  and  using  the  assumption  that  Q = diag(qn,  .... 
qpp)  is  a diagonal  matrix. 

Rg.(S,  if*") 

= R<2'(6. 30  + Eg^trl  [t^lYd,  S)-2(K-p- 1 )rt2d.  s)]£'lydsryd 
+^2tcyd.s)s-1ydivr(2d.  fiiiydCyd-^yd.  aiydO’yd) 

S Rg>(fl.30-<-Eaydtr([r2(y(,,S).2(K-p-l)r(2d.S))S'l2dQ’yd).  (3.3.8) 

The  inequality  in  (3.3.8)  uses  diagonaiity  of  y*  Vj  and  Q’,  and  conditions  (ii)  and  (iii). 
The  theorem  now  follows  from  (3.3.8)  and  condition  (i). 

Consider  the  particular  choice  of  r(2|,  S)  = r(F),  where  F = rtSyj1). 


Corollary  3.3.2  Lei  «&,  £)  = r(F).  where F = tr(S2jV  Then  SP®{Si|.  a = dp-t(P)2l 
a_,)2£  dominates  X under  the  Lg-  loss  with  Q = Qj  a diagonal  matrix  if 

(i)  0<i<F)<2(K-p-l); 

(ii)  r(F)  is  non-decreasing  in  F. 

Proof  of  Corollary  3.3.2  Since  dr(F)/35i  = r'faF/K,)  S 0 and  dF/3su  = I'OF/tau)  2 0. 
conditions  (ii)  and  (iii)  of  Theorem  3.3. 1 are  satisfied.  The  corollaty  follows  now  by  using 

Remark  3.3.1  The  optimal  choice  of  rtSj.S}  = a (constant)  in  Theorem  3.3.1  and 
Corollary  3.3 2 is  K-p- 1 which  leads  to  another  matrix  extension  of  James-Stcin  estimator 
ap-(K-p-l)Vd£-')X. 

Remark  3.3,2  By  examining  the  proof  of  Theorem  3.3.1,  under  the  same  conditions  and 
Lg  loss  with  Q — Qj,  the  risk  of  (P®  is  smaller  than  that  of  X 

The  risk  Rg-<a,  fP®)  of  jP®  defined  in  (3.3.2)  is  provided  here. 

CgrolliffY  3,3,3  Under  the  Lg*  loss,  jP®(2d.  X)  = (Ip-2tiS''ffi(S))X  has  risk 
Rg-(fl.iP®) 

- JO  + -R-j  Efl yjtrl  [n£' ‘sjKSJS  !C(£)S’ 1 -4(n+2)E^I£ JB(S)S’ 1 1 

-2(n+2)(K-p- 1 )a(£)£' 1 ] Y^/2Qi^/J+2£'  ‘ffl(S)Sa(S)S'  ) • (3.3.9) 


so 

Proof  of  Corollary  3.3.3  Replacing  1(2.  S)  = !E(S)  and  2 = 2d  in  (3.1.3).  one  gets  ihe 
expression 

Rg.(fl,flPra> 

= Rg-lfi.  X)  *Efli<1ir(S-|!|!(S)Sa(S)S‘'2(iQ‘2li-‘IisiS»(S)S''l2dQ‘Yd 

-2(K-P-i)a(5)S12iryd)-  (3.3.io) 

Note  that 

ESjl2dff5d)  = Hi!d[((viq‘jvj))] 

= ffJItrK^]  & dke<2")^d  (3.3.11) 

E^dSrYdM'^lidQ’Vd  (3.3.12) 

By  using  (3.3.1 1)  and  (3.3.12),  (3.3.10)  can  be  expressed  as 

Rg-O.?8®) 

= Rg-(fi.  X)  + ^aEflydtr{  [nSl!t(S)Sa(S)S’1-4(n+2)DS(Sffi(S)S’'] 
-2(n+2)(K-p- 1 )a(5)S' 1 ]Yd  Q* yd+2S‘1ffi(S)S  Sl(S)S'l¥d  diaglQ’JV,, ) . (3.3.13) 
Next.  (3.3.9)  follows  (3.3.13)  by  the  fact  Q*  = and  diag(Q')  = j£/2QJ  JQW. 

Nonce  that  Q„  = diag(q, , qpp)  £ (max  q^  £ (max  qu)/(min  i,i)Q.  where 

(max  qji)  £ qii  and  (min  Itf)  £l)j  for  each  i = 1 p.  and  tlj's  are  eigen-values  of  Q.  Also 
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&’1#CS)&B<S)&‘1  « symmetric  p.d..  one  has  trf  S’ ‘ssfSjSsCSia'  %nQXf\  S trJS.- 1 
ffi(i)SaKS)a'lvj/,(max  qaVfmin  TU)Qi^/2J. 

It  follows  from  (3.3.9)  that 

Rg-lft.?®®) 

s Rg’(fl,  X)  [(n+2(max  qu)/(min  tli))S’'sKS)SlS(S)S’1 

-4(n+2)Ds(Ss|!(S)S'll-2(n+2KK-p-i)a(S)S‘l]vj/J9Xj/2) 

= Rg-ffl.  X)  + (^j)ro  Ej rS'1ffi(S)Sffl(S)S'1-4r0Bs[Sffi(S)S’1] 

-2r0(K-p-l)3B<SJS-,]i^/2Qi^/2} . (3.3.14) 

where  r0  = (n+2)/tn+2(max  qu)/(min  *ij)J- 

The  inequality  in  (3.3.14)  becomes  an  equality  if  Q is  a diagonal  matrix  which 

a - EA.ET  and  »(£)  = E!E(i)  ET  as  before.  Using  the  same  argument 
described  in  Section  2.2.  we  prove  the  next  theorem  which  generalizes  Zheng’s  (1986) 

Theorem  3.3,2  fl88®  given  in  (3.32)  has  risk  smaller  than  X under  the  Lq-  loss  if 

(i)  0 < fiO)  < 2r0(K-p-l)  for  each  i = 1 p; 

(H)  34‘.(D/32ia0forcachi  = l p; 

(hi)  4'.(D's  are  similarly  ordered  with  the  kj’s.  that  is. 

W’lCU-’f'.IDMXi-Xj)  a 0 for  every  1 Si.jSp. 


Proof  of  Theorem  3.3.2  It! 


Ujd)  = >P?(D>-:l-4ro,^l>(i)-2ro(K-p-l)4'i(i)X;'<  0 (3.3.15) 

for  each  i = 1 p,  where  is  defined  in  (2.2.19)  and  is  positive  under  die 

as  mp  on  ( ) and  (iii).  Hence  Ui(X)  £ X ‘'Pi(i)PP1(i)-2r0(K-p-l)]  < 0.  by  using 
assumption  (i).  The  proof  of  the  theorem  is  complete. 

Remark  3.3.3  The  range  for'Pjd)  in  Theorem  3.3.2  is  smaller  than  die  one  in  Theorem 
2.2.3  which  reflects  the  price  for  estimating  the  unknown  v,'s  from  Vj's.  Let  be  a 

diagonal  matrix  with  ith  diagonal  clement  equal  to  (2.2.22).  Then  if  a 2 0,  b 2 0 and  0 < 
a+b<2ro(K-p-l),  (Ip-SdCaS'1  +<b/tr<S))IpllXand  (Ip-SafaS’1  +<b/tr(S2))Sl  IX  dommatc 
X for  every  Lq-  loss.  For  estimators  not  meeting  the  conditions  of  Theorem  3.3.2,  such  as 
Op-Suit'1  B(S)>2£  with  ¥&)  defined  in  (2.2.24),  and  (2.2.27)  and  (2.2.30)  dominate  X if 
(a)  0 < b < 2r0(K-p-l),  (b)  c > 0,  and  (c)  0 S d £ p(K-p-l)/2.  and  if  (a)  0 < b < 2ro(K-p- 
1),  (b)  c > 0.  and  (c)  0 £ d £ (K-p-l)/2.  respectively.  Also  0p-£i(S'l!B(S)la)X  dominates 
X under  Lq-  loss,  if  a s [0,  c'0'(K-p- 1 )(p- 1 )•  { 1+|  1 +Co(p- 1 )'2(K-p- 1 )'2((K-p- 1 )2 
+<p-l)2)]l/2)].  CO  = [(IMHp'aCp^)]2,  where 

{S',ffi<S))a  = (l-tx)[(K-p-l)S'l+((p.l)/ti(S))Ip)+ct[((Kp-2)Ar(5))lp|  (3.3.16) 
and  the  optimal  value  of  a is  c«(p*  1 )(K-p- 1). 


Remark  3.3.4  It  can  be  shown  that  estimators  of  the  form  6EB<2)  given  in  (3.3.2)  also 
Next,  we  consider  estimators  of  the  form  (3.1.13)  with  2 =&, 


i in  (2.1.5). 


Sp^fY,  JO  = (Ip-rtYd,  SJYdS-'-YdS'toJJi- 


(3.3.17) 


From  Theorem  3.1.2,  it  follows  that 

Rg-ffl,  P^’tYd.  JQ)-Rg*(fi.  F'H.  JO) 

- Ea Ydtr|ECt2(i>A1+2r(Y),  S)m)A‘‘lEIYd SfYd 

-£I4^  1)(2J+2(K-p- 1 )ai(DA' 1 lETYd  STYd  1 . (3.3.18) 

where  r(Yi.  S)  is  differentiable  wiih  respect  to  Sy  for  each  i,  j = 1 p.  and  S3  is 

non-increasing  in  Vj  for  each  i =1 p. 

1.3.2  as  in  (3.3.5M3.3.7),  one  gets 

Eyd(i<Y|.S)YdQ’Yd) 

= S)Yd  STYd+l^lrfYd.  S)Yd  diag(Q‘)£i 

♦(^YelYrtYd.aJYdffYd).  (3.3.19) 

and 

Eyd(r(Yi.S)Yd1Yd) 

- E^l(^)r(&.aip+(^)Ydl2rtYd.a)l)-  (3.3.20) 

Combine  (3.3.19)  and  (3.3.20)  to  get 

EydMYi.£)YdQ'Yd) 

= eYi(02  SlYdQ'Yd-^jYdQ’YdYdlYrfYd.  S)1 


K^j)iWS(2d.S)lSdSrSd). 


(3.3.21) 


Using  (3.3.1 1).  (3.3.12)  and  (3.3.21).  one  can  rewrite  (3.3.18)  as 

Rq-(2.  Sp^ffld,  20>-Rg*(fi.  P(1)(5d.  20) 

= Eaidtr(m;2a)A‘1-*-2r(Sd.  &m)A-|lET[(^)2YdQ‘i'd 

•^2  & diaglQ'lYdl-Hd^1  '(AJ+ZfK-p- 1 )S0)A'11ET(^)  YdQ'iid 
+2EIr(Sd,  SmDA’1  ]ET[^ydQ'i!d2<il2'<Sd.  S)1 
’^SYjdiag<Q-)YdY1,[2riYd. S>w£$W2r<2* S)l2i CTYd))-  (3-3.22) 
Suppose  Q = Qd  then  Q*  = diag(Q")  and 

Rg'fSL  aEB(3|(Yd,  Kt)-Rg-(fl,  Sp"’^.  20) 

£ E9Yd'r|<^)C£2a)A'‘-2[K-p-l-r(Yd,S)l!E(i)A'' 

-4)E<l)(i)lETY^/2QdV|flE|.  (3.3.23) 


To  show  (3.3.23)  < 0 for  all  2 and  Yd,  it  suffices  to  show  A2i&)  < 0 for  each  i = 1 p, 


Ajjttj  = 'K?&)X:I-2[(K-p-i).t(2d, BPPiQ)Xj,-2  I rrayvflmr^ 


-4tWi(lv9Xi]. 


(3.3.24) 


Now  we  prove  §EBG)(Sd,  2D  dominates  2D  under  the  Lg-  loss  with  Q = 

Ol  by  providing  solutions  to  (3.3.24)  < 0 for  each  i = 1, ....  p. 

Theorem  3.3.3  Let  0<  rtij,  S)  < (K-2) , rQJ|,  S)  is  differentiable  with  respect  to  Sy  for 

each  i.j  = 1, ....  p.  Suppose  4'l(i)  = Xjt(U)/(  £xp.  i - 1 p.  where  U is  a function  of 

theXi’s.  Assume  that 

(i)  rtVj.  S is  non-increasing  in  w,  for  each  i =1, ....  pi 

(ii)  Ud)  is  non-decreasing  in  Xj  for  each  i = 1 p; 

(ill)  t(U)  is  non-decreasing  in  U; 

Ov)  0<t(U)<2[(K-2).rffld.  S)l- 

Then  A2iQj  < 0 for  each  i = 1 p.  Hence.  iP^fSd.  2D  dominates  jp0^,  2D 

under  the  Lg-  loss  with  Q = Qj. 

Proof  of  Theorem  3.3.3  For  the  particular  choice  fid)  = XjtftW  f,Xj),  one  obtains  from 
(3.3.24)  that 

Aad)  - X,^(UV(£Xj)2-2((K-p-l)-r(Xi.a)t(UV(iXj)-2(p-l)t(UV(iXj) 

^[t(uv(ixj)-xit(u)/(ixj)2+(Xj/|xj)t'(U)(du/axi)] 

= [t2(UH4tOfl)Xi/(£xj)2-2(K-r(2d,  S))t(UV(£Xj) 
-4(v£xj)T(U)(au/aXi) 


[t2(U)*2[(K*2)-r(^j.  S)Jt(U)]/(  £Xj)  • 


(3.3.25) 


under  die  assumpdons  (i)-(iii).  The  conclusion  of  this  theorem  followed. 

If  we  concentrate  on  the  improvement  over  (Ip-aSuJi ')2£,  that  is,  r(2d,  S)  = a is  a 
constant  function,  then  0fl-a5dS'l-[t(U)/ir(S)iyd)X  dominates  (Ip-a&ii'1)*  under  the 
Lg*  loss  with  any  p.d.  Qis  shown  in  the  following  theorem. 

Theorem  3.3.4  Let  0 < a < r0(K-2)  where  r0  = (n+2)/(n+2(max  qH)/(min  1)l)l  Suppose 

TiQJ  = MU)/(iXj).  i = 1 p.  where  U is  a function  of  the  Vs.  (Ip-aSdi'1- 

>1 

[ttUI/trlSHVjJX  dominates  Qp-aydS‘l)X  under  the  Lg*  loss  with  any  p.d.  Q if 

(i)  U(A)  is  non-decreasing  in  X*  for  each  i = 1,  ...,p; 

(ii)  t(U)  is  non-decreasing  in  U; 

(tii)  0 < t(U)  < 2[r0(K-2)-a]. 

Proof  of  Theorem  3.3.4  In  view  of  (3.3.22), 

Rg-(a.  jF^Od.  X))-Rg-(2.  aEB,,)(X).  X)> 

S Efl^tr|EO;2(i)Al+2a±(i)A-|IETI(^)2 

4-^j  ((maxqliV(minrii))lv|,2Qi^'2 

-Ei4^'>a)+2(K-p-i  ao)A-  'iet(^)  ^nQy^n ) 

= Ea  Yd*'!  (^5iro  ffi2a)A-  '-2[r0(K-p- 1 )-aUE(i)A'' 


-4r02!*,*(D]ETYdQ-Yd£l- 


(3.3.26) 


To  show  (3.3.26)  < 0 for  all  S and  Vj  is  suffices  lo  show  A2'i&)  < 0 for  each  i = 1 p, 

where 

A2'i(D  - ^fOJX;1-2(r0(K-p-l)-a)1'i(mi'-2ro.Z^i(i)-'l'ia)V(Xi-Xj) 

-4r0[3'Piaj/3>iil.  (3.3.27) 

With  the  choice  of  %&)  = Kjt(U)/(  ^Xj),  one  obtains  from  (3.327)  that 

A2'j(D  = XjftUVf  £Xj)2-2[ro(K-p- 1 )-a)t(UV(  £Aj)-2ro(p- 1 )t(U)/(  £xj) 
j-l  H >■> 

-4r0[t(UV(£2j)-Xjt(UV(£2j)J+(Xi/^j)T(U)OU/3Xj)] 

= [^(UJ-MrotfUDVI  £Xj)2-2(r0K-a)t(U)/(  £k) 

j-l  j-l 

-4r0(vi^j)t'(U)(3U/3^) 

S (t2(U)-2[r0(K-2)-a)t(U)]/(  £X;)  < 0,  (3.3.28) 

j-l 

under  the  assumptions  (i)-6ii),  and  the  proof  is  complete. 

town*  3.3.?  For  the  particular  choice  of  a = r0(K-p-I),  (Ip-aYd£'l-(t(U)/ir(S)iyd)X 
dominates  (lp-aSdS'')2(.  under  the  Lg-  loss  with  any  p.d.  Q whenever  0 < t(U)  < 
2ro(p-I).  Furthermore,  if  t(U)  = b is  also  constant,  it  can  be  shown  that  the  optimal  choice 
is  b = r0(p-l).  Notice  that  if  Q is  a diagonal  matrix,  then  Theorem  3.3.4  can  be  slighdy 


Remark  3.3.6  Consider  the  Lq  loss  instead  of  Lq*  loss  in  (3.3.18)-(3.3.22).  The 
conditions  in  Theorem  3.3.3  and  Theorem  3.3.4  still  support  Rgfk  2D)  < 

Rq(&.  2D). 


covariance  matrix  V.  of  2£|'s  is  complete  unknown  and  needs  to  be  estimated  from  the 
supplementary  data.  We  assume  2 - W(p,  n.  (n+p+l)-lV)  is  available  as  in  (3.1.10).  In  a 
balanced  one-way  MANOVA  model  introduced  in  (2.1.2),  suppose  Vo  is  entirely  unknown 
and  n = K(m-l).  Then 

2 = m 'x  I (2£jj-2£i)(2£ij-2ii)T/(n+pt  1 ) 


is  the  best  multiple  estimator  of  Y = m’'Vj).  Note  that  conditional  on  fi,  2£i’s  and  V are 
independent,  and  the  distribution  of  V does  not  depend  on  g.  Also  marginally,  (£,  V)  is 
the  complete  sufficient  statistic  for  (£,  V). 

The  two  cUsses  of  EB  estimators  gEB<l>(2,  2D  and  ?®®(2,2D  defined  in 
(3.1.1 1)  and  (3.1.12)  respectively  are  going  to  be  used  in  this  section.  Estimators  if6®  of 
the  form  (3.1.1 1)  are  direct  extensions  of  Baranchik's  (1970)  type  of  estimators  for  the 
matrix  normal  means  and  complete  unknown  covariance  matrix,  while  cstimatots  g68® 
defined  in  (3.1.12)  generalize  those  in  Efron  and  Morris  (1972b,  1976b),  Stein  (1974), 
Dey,  Ghosh  and  Srinivasan  (1985)  and  Zheng  (1986)  for  complete  unknown  covariance. 

(3.1.1)  and  the  generalized  version  of  the  James-Stein  estimator  namely  Up-a2s‘')X  is 
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Recall  an  EB  estimator  if8  has  risk  Rg‘<&.  fi88)  expressed  in  (3.1.3).  Based  on 
the  foim  of  if8*1*,  one  way  of  expressing  the  risk  Rg-(fi.  if8*1*)  is  given  in  the  following 
corollary. 

Corollary  3.4.1  Under  the  Lg-  loss,  fi88*1*^.  2D  = (Ip-r(2,  S)Y£‘')X  has  risk 

Rg-ffl.?80*) 

= Rg-(fi,X)+n(n-t-[»-|r2E6i.tr|(n+l)r2(2,S)S‘lYQ'Y+r,(2,S)triQ-y)S-'Y 
-2(n+p+l)(K-p-l)r(2,S)£’1YQ"Y)+2(n+p+l)'JEjytr||I2Yi2(2,  S3) 
[(n+p-n)2s'12Q,Y+9ti<Q-y)S'1Yt(n+i)ys‘1YQ-vi 
-2(n+p+l)(K-p-l)[l^i<2.S)]2S-,YQ*Y) 

-4EeYtr||E>ar(Y,a)]2Q,Y).  (3.4.1) 

Proof  of  Corollary  3.4,1  By  expressing  k2.  £>  = r(2.  S)Ip  in  (3.1.3),  one  gets 

Rg-tfi.fi88*1*) 

= Rg"<fi,2D  + Ejytr{r2(2,S)S'l2Qr2-4[D^r(2.S))2Q‘Y 
• 2(K-p-l)r(2,£)S'lYQ'Y).  (3.4.2) 

Recall  Y ~ ^(p.  n,  (n+p+l)'*Y)  is  independent  of  i,  and  define  QY  = ((d¥))  as  in 
Lemma  1.3.3.  Applying  the  Wishart  identity  in  Lemma  (1.3.3),  we  have 

Ey{r(2,S)2) 

= EyKn+p+ir'Y-Kn+p+ir'Y]-1^.  S)2) 


►P+ir'y-EyUDilKS,  SJVMn-p-lMY,  SJIp] . 


(3.4.3) 


By  perfonning  ihe  formal  multiplication  and  differentiating  coordinatcwise  (see  Lemma  1 . 
Haff(1981)).  we  get 

e£my.s)yi 

- a£*Y.  s)]2+[(rtS.  s)yTa^)TiTY 

= [B^rtY,  S)lY+r(  V.  S)U^V],  (3.4.4) 

[■^yi  - (a^Wuxavg/avn))) 

= ((Zja+SmJSij)) 

= ((^*1)6, j)) 

= 5<P+l)Ip-  (3.4.5) 

Combine  (3.4.3H3.4.5)  to  obtain 

Ey(r(Y.S)Y)  »(n+p+l)-*YEirl2n^r(y.S)j5+n  r(2.S)Ip).  (3.4.6) 

Eflytr(r(Y.S)S'lYQ,Y) 


= Eaytr(5-'r(2.  S)Y[(n+p+l)',YD’l((n+p»l)‘lY]Q"V) 
= (n+p+ir'Ejytrt  YQ’YS'VY.  SJYKn+p+lf'Y)'1 ) 


= (n+p+ir'Eflyn-iy  Q'ys'Vy.  S)2[(n+p+i)‘lM'1)T 

= (n+p+i)'*E0ytr|[(n+P+i)'1y]"1i(y,  S)2S''yQ'¥) 

- (n+p+l)-1EaYtr{2[I^t(y,S)]yS’,yQ,Y+nr(2,S)S-|YQ,y).  (3.4.7) 

Next,  we  consider  the  expression 

HflYir(r2(y.S)a‘1yQ,S) 

= Hayir(r2(2,S)YS'15Q') 

= (n+p+ir1ESYir([(n+p+ir,y]-|i2(2,s)ys'l5Q'y).  0.4.8) 

Lemma  1.3.3  is  applied  again  for  V ~W(p,  n,  (n+p-fl)"*Y)  to  get 

Ey([(n+p+l)'1yj*lr2(yi  S)2S‘'2 ) 

= Ey(2ES[i2(2.5)2S'12]+(n-p-l)r!(2.fi)S'l2).  0.4.9) 

where 

Ey(D2[i2(2.a)2s'12i) 

= ey{  aM2.  £)]2  s-|2+i(rJ(2.  s)yTa^)TiT2  s‘2) 

= Ey{  nM2,  s))2  S'12+rJ(2,  S)d2(2  S''2) ) , (3.4.10) 

and 

d2(2s‘i2) 

= e&2  ]S'12+r(2T(ES)T)Ts'l)2+[(s-|)T2Ta^)T]T2 

- [<^i)ip]S*12+Q+t5«T(a'12)ip+is-,2] 


(e2)S-i2+iws-,2). 


(3.4.1 


Both  (3.4.10)  and  (3.4.11)  use  the  identities  in  Lemma  1 of  Haff  (1981).  We  have  ffiSy  1 
= (p+lWJp  from  (3.4.5)  and  [(£'1)TYT®2)TlTy  = 2',Itr(S.-,2)Ip*&'‘2j  from  Lemma  3 
of  Haff  (1982).  From  (3.4.8)-(3.4.1 1),  one  gets 
EaSrtr{rJ(2,S)£-1yg,£) 

- (n+p+i)-1 'Ejytri 2nM2, s)]2s''2 &&A2. s)trta‘2)Q‘v 

+(n+l)i2(S.S)S'12Q'y).  (3.4.12) 

Eaj-trlr^y.&trtS-'yjQ’y) 

= EaYtr(5-1y)tr{r2(2.5)Q-y) 

- Ejytri^iy,  5MQ'y)S'ly ) 

= Eaytrl^ty.^VtrtQ'm1),  (3.4.13) 

Eflytr{i2(y.s)a-‘yQ-y) 

= Efliir(^(2,sQ*ya-1y) 

. Eaytr(i2(2.s)Q‘yfi-,2)T 

= Eaj,tr(r2(2.S)25'lyQ-). 

Also 


(3.4.1 


Eyl^fS,  S)2)  = (n+p+ 1 )’ 1 Y-Ey  | 2[d5  i^(2.  SlS+B-^S. 8)1.  (3.4.15) 


as  in  (3.4.6)  by  replacing  r<2,  S)  with  r<2.  S). 

Combing  (3.4.12)-(3.4.15),  one  has 
EflYtr(r2(S.S)S'l2Q‘2) 

= (n+p+l^EjytrlZfn+p+Dttfir2^.  S)]2  S''2  ffi! 

+2[c2i2(y,  a)]Vn(Q'y)S'ly+nr2(y,  S)tr(Q*y)S'ly 
+2(n+l)|]^r2(2,S)]SSlyQ'y+n(n+l)r2(2.S)a-|yQ"y).  (3.4.16) 

Substituiing  (3.4.7)  and  (3.4.16)  into  (3.4.2)  to  get  (3.4.1),  the  proof  is  complete. 

Consider  the  particular  choice  r(2, 8)  = r(F),  where  F = tr(£''V).  Then  |Usr(V, 
S)]  =■  r'(F)[D?F)  = ffiKr&SJ]  - i'(F)|Ig£Fl,  and  0^rJ(2,  8H  = 

2r(F)  r'(F)[cS F]  where  ID^F]  = Q^tr(S''2)l  = -2‘*a2'1,  from  Lemma  6 of  Haff 
(1982).  We  have  the  following  theorem. 

Theorem  3.4,1  Let  r(2.  S)  = r(F),  where  F = trtS-'2>.  Then  estimators  of  the  form 
9EB(I,(2.  X)  = 0p-r(F)2 S* 1 )X dominate  X under  the  Lg-  loss  if 

(i)  0 < r(F)  < 2ro(K-p-l); 

60  r(F)  is  non-decreasing  in  F; 

(Hi)  is  n.n.d., 

where  r0  = (n+ptl)/|n+tr(Q)/(min  T)j)+1], 


T|(I^r2(F)][(n+p+l)Y£'1SQ‘y+ytrtQ'iOS'ly+<n»l)yS'1yQ‘Y]l 
= -2r(F).r'(F)tr|(n+p+l)Q+li<Q)ywyy-w-Kn+l)y'WV-,/2QV,ffly'lc) 


S 0.  (3.4.17) 

The  inequality  follows  from  the  assumption  (ii)  and  all  y , y and  Q are  p.d..  Also. 

tr(02Sr<F)]yg,y)  =tr(r,(F)y-'yQ-y)  =r'(F)tt<Q)2  0,  (3.4.18) 

since  |B^r(F)]  = r'fRIE^F]  = r'tRV'1.  r(F)  is  non-decreasing  in  Fand  Q is  p.d.. 

Using  (D^rfF)]  = -r'(F)y-‘sy'1  to  get  tr||^rtF)]yS'1VQ,y)  = tr{-r'(F)y-|V 
Q’y).  From  the  Wishart  identity 

EjjiyVff))  =Ey(2|l£r'(F)]+(n-p-l)y-V(F)).  (3.4.19) 


E2ir(i'<F)y-1yffy) 

= (n-p-l)-|Evtr(r'(F)Q-2Qwy1,Ja^r'(F)]yl/2Q1/2).  (3.4.20) 

Hence  form  (3.4.18)  and  (3.4.20),  one  has 

(n+p+i)',(K-P-  i)tr(  rt2,  a»ys'y  Q*y  )+tr(  icSr(y,  S)]y<2'y  I 

= [ 1 -((n+p+ 1 )(n-p- 1 ))- 1 (K-p- 1 )]r-(F)tr(Q) 

+2((n+p+l)(n-p-l))-|(K-p-l)tr{Qlflyw|I^r'(F)iyl/2Q1'2)  aO.  (3.4.21) 

since  (E3£r'(F)]  is  assumed  n.n.d„  and  n is  usually  larger  than  K and  r(F)  is  non- 
decreasing in  F.  From  (3.4.1),  (3.4.17)  and  (3.4.21),  it  follows  that 

Rg-(fl,flEB<l)>Rg-(fi,X) 


£ n(n+p+l)'2Egytr{(n+l)r2(y,£)S'1yQ*y+r2(y,  S)n((2*)QSf *¥ 


-2(n+P+i)(K-p-i)r(2,as''yQ,y).  0.4.22) 

Recall  that  (min  Rj)  is  ihc  smallest  eigen-value  of  Q and  S'1'2  jr1/2Qy.1/V'2  is  p.d.. 
Using  trtQ'Y)!,,  = trtffllj,  £ (n(QV(min  n,))Q.  one  gets  tr(tr<Q,iD  S.’ly)  - trt£'l/J 
yl«triQ)y‘«S-|/2)  £ trtS-|/2yl,2[tr(Q)/(min  tli)lQy.inS.‘l/2).  Thus  we  have  the 
following  inequality. 

Rg-fa.  a^-Rg-ca.  X) 

£ n(n+p+l)'JEgytr([(n+tr(QV(min tlO+Ur^Y. £)-2<n+p+l)(K-p-l)r(Y. S)1 

.£.|/2yi/2Qyl/2a-i/2,  (3.4.23) 

Then  the  result  is  a consequence  of  assumption  (i).  The  proof  is  complete. 

Remark  3.4.1  Suppose  r(V.  5) - a is  constant.  In  view  of  from  0-4.23). 

Rg-(a.fiEB<l)>-Rg-(&.X) 

= n(n+p+l)-|r^Eastr((a2-2r0(K-p-l)a]S-',2Y1/2QVI'2S'IC).  (3.4.24) 

It  can  be  shown  that  the  best  value  of  a is  ro(K-p- 1 ) and  the  corresponding  estimator  is  the 
matrix  form  of  Jamcs-Stein  estimator  under  the  Lg-  loss  and  V complete  unknown,  that  is, 
Op-roOC-p-uya-1)*. 

Next,  we  consider  the  family  of  estimators  of  the  form  (P®  defined  in  0-1. 12). 
The  following  corollaty  provides  the  risk  for?8®  under  the  Lg*  loss  in  (3.1.1).  The  risk 
is  denoted  by  Rg*(&,  S88®). 


Corollary  3.4.2  Under  ihe  Lg-  loss,  ?*W(2,  20  = (Ip- 2 £'‘a(S))2£  has  risk 


V<a,  aEB(2)) 

■ Rq-O,  20  +n(n+p+l)‘2  E2ytr(  [£'1!i>(S£ai(S)£'l-4(n+p+l 

- 2(n+p+l)(K-p-l)a(S)S',]Y1/2QY1/2+S'lll!(£)5a(S)S'lir<Q)Y)  - (3.4.25) 

Proof  of  Corollary  3.4.2  With  the  change  of  1(2.  S)  with  ffl(S)  in  (3.1.3),  we  obtain 
RQ-(a.fiP®) 

= Rg-(S.  20  + Eg  Yrr  ( S’  'a(S)S  ffi(S)£‘ 1 2 O'  «(£)£' 1 1 2 <TY 

-2(K-p-I)!b(S)S-|2Q'Y)-  (3.4.26) 

Since  V - iV(p.  n,  (n+p+l)_lV),  by  applying  Lemma  1.3.3  twice  to  get 

Ey[2  Q*2j  = n(n+p+l)‘2((n+l)YQ*Y+trQ;Q,)iQ.  (3.4.27) 

Also  Ey(2l  = n(n+p+l)‘lX 

In  view  of  (3.4.26).  one  has 

Rfi-ffl.?8®) 

= Rq-(0.  20  + n(n+p+l)-2Eflytr(a-1sKa)affi(S)5'll(n+l)YQ’Y+trQ'Q')Y] 

-4(n4p+l)E5(Sat(£)£'1lYQ’Y-  2(n+p+l)(K-p-l)ffl(S)£-1YQ-y).  (3.4.28) 

Equation  (3.4.25)  is  a direct  consequence  of  (3.4.27)  and  Q*  = V',/2Q  yw.  The  proof  is 
complete. 


Note  trtQ)i  fi  (tr(Q)/(min  ni))3£l/203£l W.  The  risk  differemce  of  Sp®  and  X is 


Rg-(a,aEE0>>RQ-(a,x) 

S n(n+p+  1)'J  Eg  ytr  ( [(n+tr<Q)/<niin  T\j)+ 1 )£'  ‘ffi(S)S  aKS>S' 1 
-4(n+p+l)I2^(SiB(S)S'll-2(n+p+lXK-p-l)!K(SJS'lly,,,Qil/2) 

= n(n-tp-fl)-1r0lEeYir((S-1B(S)Ss[!(5)S'l-4roESfSa<S)S‘l] 

-2r0(K-p-l)4|KS>£1]V,/2QY.l/2!,  (3.4.29) 

where  tq  = (n+p+1  )/(n+tr(Q)/(nun  T)i)+1). 

As  before,  we  discuss  die  particular  choice  of  ffi(£)  = EIE(1)ET.  Then  Zheng's 
(1986)  Theorem  2 is  extended  in  the  next  theorem  to  the  situation  that  V is  complete 
unknown  and  the  loss  function  is  Lq'. 

Theorem  3.4,2  Estimators  of  the  form  Sp®(2,  X)  = (Ip- 5 S^ESO)  ET)X  dominate  X 
under  the  Lg*  loss  if 

(i)  0 <4'i(i)<2r0(K-p-l)  for  each  i = l,...,p; 

(ii)  34'i(i)/3Xi  a 0 for  each  i = 1 p; 

(in)  T.(A)'s  are  similarly  ordered  with  the  Xj’s,  that  is, 

(4'ia)-H'ja))(Ai-Xp  2 0 for  every  1 £ i.  j S p, 
where  r0  = (n+p+l)/|n+ir(Q)/(min  T|j)+lJ. 

Piaof  Of.niMrem  3,4,2  Sec  the  proof  of  Theorem  3.3.2. 
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Remark  3.4.2  AU  the  results  in  Remark  3.3.3  still  hold  for  iP®<2. 20  = Op-2  S'1 
ES&JE^X  by  the  obvious  replacement  of  new  r„  = <n+p+l)/[n+tr(Q)/<min  Hi>+1]  and 
2,  with  2- 

The  improvement  offiEB(3)(2, 20  defined  in  (3.1.13)  over  aEB<1,(2. 20  of  the 
form  (3.1.11)  is  considered  in  the  rest  of  this  section.  First,  Theorem  3.1.2  is  extended 
based  on  the  assumptions  2 ~ W(p,  n,  (n+p+lf'V)  and  r(  V,  S)  = a is  constant 

It  follows  from  (3.1.14)  and  (3.4.27),  the  risk  difference  of  Sp<3|(V,  20  * 
(Ip'*2S’1-2S"'f(S))X  and  SEB<1>(2.  X)  - (Ip-a2S'‘)X  under  the  Lg-  loss  is 

n(n+p+l )%  Ytr(ECE2(i)A' 1 t-2a£(i)A''lETl(n+ 1 U Q’V-htQ!  Q')V) 
-(n+p+l)EI4^‘>(i)+2(K-p-l)±QJAl]ETyQ'y) 

S n(n+p+l)'lrQEjytr(CEJQOA'l-2(ro(K-p-l)-a)!E(DA'l-4ro!E(2i)A"ll 

•ETylflQyl^)  (3.4.30) 

To  show  (Ip-a2  S'l-2S‘lf(S))X  dominates  (lp-a2  S'')X  under  the  Lg-  loss  is  suffices  to 
show  (3.4.30)  < 0 or  A3i(D  < 0 for  each  i = 1 p,  where 

A3i(D  = 'P?(i)X;1-2[r0(K-p.l).a]4'i(DX;l-2ro 

-4r0I3'Pi(i)/aXi).  (3.4.31) 

Then  the  following  theorem  provides  solutions  to  A3i(i)  < 0 for  each  i = 1 p. 


Theorem  3.4.3  Lei  0 < a < ro(K-2)  where  r(j  - (n+p+l)/[n+lr(Q)/(niin  Tlj)+1].  Suppose 

TiO)  = X,t(U)/(iXj),  i = I p,  where  U is  a function  of  the  Vs.  Up-aSi'1- 

|t(U)/tr(S)l2)X  dominates  (Ip-aYS‘')X  under  the  Lq-  loss  if 

(i)  U(D  is  non-decreasing  in  X*  for  each  i = 1 p; 

fti)  t(U)  is  non-decreasing  in  U; 
fni)  0 < t(U)  < 2[r0(K-2)-a]. 

Remark  3.4.3  In  view  of  Remark  3.4.1,  the  matrix  extension  of  the  James-Siein 
estimator  (Ip-ro(K-p-l)Y  £‘')X  is  the  best  estimator  of  the  form  (Ip-aY  £‘')X.  although 
(Ip-ro(K-p-l)y  S’1-(t(U)/tr<S)]y)X  dominates  (Ip-ro(K-p-l)y  S'')X  under  the  Lg-  loss  if 
0 < t(U)  < 2ro(p- 1 ).  If  t(U)  = b is  another  constant,  then  the  optimal  choice  of  b is  ro(p- 1 ). 


To  illustrate  the  the  estimation  results  derived  in  the  previous  sections,  we  compare 
the  simulated  risks  of  five  different  estimators  of  fi  under  the  three  different  assumptions  of 
V.  given  in  (3.1.8)-(3.1.10).  These  estimators  are  X.  and  XJ  = 
Op-  YS‘‘l(j)<Y.  £)>X(j-1.2.3, 4.)  where 

Hl)<2.S)"  r0( (K-p- 1 ilp+Kp- 1 )/tr(S)|S  );  (3.5.1) 

I(2|<2,  £)  - r0{ (K-p- 1 )lp+((p- 1 )/tr(S2)lS2 1 ; (3.5.2) 

I(3)<2,  £)  - roKK-p-Oy;  (3.5.3) 


I(4)(2.£)  = ( l-ao){  (K-p- 1 Jlp+tfp- 1 )/tr(S)l£l  tool  [(Kp-2)/tr(S)lSJ , (3.5.4 


where  do  = (K-p-l)/[K2(p-l)]  and  r0  - 1 under  the  assumptions  of  (3.1.8)  and  (3.1.9). 
and  Oo  = (p-l)(K-p-l)/l(K-2)-(Kp-2)rQ']2  and  r0  = (n+p+iyln+tr(Q)/(min under 

the  assumption  of  (3.1.10). 

Throught,  we  have  considered  the  loss  (3.1.1).  As  in  Section  2.4,  ihe  (K,  p) 
combinations  chosen  are  (4,  2),  (8,  2),  (12.  2).  (5.  3).  (9.  3)  and  (13, 3).  For  each 
combination.  1000  samples  are  generaicd  from  Npfflj.  V).  Also  1000  samples  of  2 are 
generated  independently  according  to  the  assumption  of  V.  The  constant  n.  degrees  of 
freedom  of  the  chi-square  distribution  and  the  Wishan  distribution,  has  been  chosen  as  3 
times  K.  Also,  it  is  assumed  that  v - v(n+2)-1  yjj  with  v = 1 in  the  fust  case. 

For  p = 2.  we  have  taken  8|  = Qor0i  = (10, 35)T  for  each  i = 1 K.  For  each  p 

= 3,  our  choice  is  Si  = Q or  flj  = (6.  10.  50)T  for  each  i = 1 K.  Corresponding  to  a 

given  situation,  for  each  (K,  2)  pair,  there  are  four  (Q,  V)  combinations  with  each  Q or  V 
equal  to  Ij  or  = diag(0.0 1 . 1 .00).  For  each  (K,  3)  pair,  there  are  once  again  four  (Q,  V) 

combinations  with  each  Q or  V equal  to  I3  or  M3  = diag(0.01 . 1 .00. 100.00).  Thus,  in  this 
situation,  for  each  (K.  p,  Q.y.  Bj),  the  simulated  risk  is  the  average  of  1000  expected 
losses.  There  are  four  tables  for  each  of  the  three  situations  of  covariance  matrix.  In  each  of 
them,  the  first  two  tables  report  the  simulated  risks  for  fij  = Q (pxl)  while  the  last  two 
report  the  same  when  8j  = (10, 35)T  and  Si  = (6. 10, 50)T,  respectively. 

estimator,  shrinkage  estimators  of  matrix  noimal  means  perform  the  best  when  the  true  Bi 
equals  the  point  towards  which  we  want  to  shrink  (in  this  case  B (pxK)).  As  the  true  Bi 

f covariance  matrix  goes  from  (3.1.8)  to  (3.1.9).  and  decreases  as  the 


the  oiher  i 


: held  fixed. 


Another  point  to  note  is  that  for  fixed  p,  Q,  and  y,  the  risk  improvement  of  each  estimator 
increases  over  K.  The  improvement  from  K - p+2  to  K = p+6  is  much  more  pronounced 
than  the  one  from  K = p«>  to  K = p+IO.  Also,  it  may  be  noted  that  8^,  Sp,  and  0^  all 
outperform  8^  in  terms  of  risk  improvement  when  the  true  8,  values  are  zeros,  although 
for  Si's  far  away  from  zeros,  all  the  four  estimators  are  nearly  at  par.  Finally,  we  observe 
that  8^  has  a consistent  edge  over  the  remaining  estimators  when  Si's  arc  zeros,  while  8(4) 
performs  as  well  as  the  others  when  Bj's  are  far  away  from  zeros.  Thus,  again,  we 
recommend  the  use  of  0^  in  the  present  context. 
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CHAPTER4 

EMPIRICAL  BAYES  MINIMAX  ESTIMATORS  OF  MATRIX  NORMAL  MEANS 
FOR  ARBITRARY  QUADRATIC  LOSS  AND  KNOWN  COVARIANCE  MATRIX 
IN  THE  TWO-SAMPLE  PROBLEM 


We  consider  the  problem  of  estimation  of  one  of  the  two  sets  of  matrix  normal 
means  when  it  is  suspected  that  the  two  matrix  notmal  means  ate  equal.  Let  X[.  and  be 
independent  p-dimcnsional  column  vectors  (i  - I.  ....  K.  K > p+1).  where  Xu's  are 
independently  distributed  as  Np(fl||,  Y).  while  JJjj's  are  independently  distributed  as 
Np(fi21.Y).thatis. 

2£hil2hi-Np(fihi.y).  (4.1.1) 

h = 1, 2;  i = 1. ....  K.  In  the  above,  are  unknown,  but  V is  a known  pxp  symmetric 
p.d.  matrix.  Let  2*  = (fin, 2t,K).  = Ogi 2^hK>-  h - 1.  2.  Our  goal  is  to 

Chapter  5.  Such  problem  can  appear  in  a balanced  two-way  MANOVA  model 

Xhij-fiN+Shij-  (4-1.2) 

h = 1, 2;  i = 1. ....  K;  j = 1, ....  m.  where  Xgy  is  a pxl  vector  of  responses  of  the  jth 
individual  in  hth  block  receiving  the  ith  treatment,  and  Sw  is  the  pxl  vector  of  ith  treatment 
and  hth  block  effect  combination.  The  vectors  say's  are  independendy  distributed  as 
Np(&  Yo).  where  Yo  is  a known  pxp  symmetric  p.d.  matrix.  The  minimal  sufficient 

statistic  for  fia  = (fia, SaK)  isXa  = (Xa, XaK>.  where  Xai  = m'1 1 Xay.  h - 1. 

2.  The  Xai's  are  now  independendy  distributed  as  Np(Saj,  Y).  where  Y - m‘lYo  ttnd  K > 
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erence  in  block  effects  (Qq  = 62;,  i = 1 K), 

in  which  cose,  observations  in  second  block  can  possibly  be  used  effectively  together  with 
those  in  fast  block  for  estimating  the  population  means  of  the  first  block.  Thus,  the 

one  or  the  pooled  sample  mean  (Xn+XalA  from  both  blocks.  One  natural  way  to  settle 
this  problem  is  to  test  the  null  hypothesis  of  the  equality  of  the  two  block  effects,  and  then 
use  a compromise  estimator  which  leans  more  towards  the  sample  mean  from  block  one  if 
the  null  hypothesis  is  rejected,  and  use  the  pooled  sample  mean  when  such  a hypothesis  is 

estimator  (PTE)  which  uses  the  pooled  sample  mean  when  the  null  hypothesis  is  accepted 
at  a desired  level  of  significance  and  uses  the  sample  mean  from  block  one  if  the  opposite  is 

incorporating  the  degree  of  evidence  for  or  against  the  null  hypothesis  in  order  to  arrive  at 

In  this  chapter,  we  propose  instead  three  classes  of  EB  estimators  which  achieve  the 
intended  compromise.  Such  an  EB  estimator  is  quite  often  a weighted  average  of  the  pooled 
sample  mean  and  the  first  sample  mean.  The  weights  are  adaptively  determined  from  the 
data  in  such  a way  that  the  larger  the  value  of  the  statistic  used  for  testing  the  equality  of  the 
two  block  effects,  the  smaller  is  the  weight  attached  to  the  pooled  sample  mean.  Thus, 
unlike  the  PTE,  the  EB  estimator  incorporates  the  degree  of  evidence  for  or  against  the  null 
hypothesis  in  a very  natural  way.  Furthermore,  we  provide  modified  versions  of  the  EB 
estimators  which  still  are  weighted  averages  of  the  pooled  sample  mean  and  the  first  sample 

estimators  depending  on  the  acceptance  or  rejection  of  the  hypothesis.  Such  estimators 
continue  to  incorporate  the  degree  of  evidence  for  or  against  the  null  hypothesis  in  a very 
natural  way  and  arc  shown  to  be  better  than  the  PTE  in  ftequentist  sense. 
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In  Older  to  motivate  the  EB  estimator,  we  need  a Bayes  procedure.  The  distribution 
in  (4.1.1)  is  treated  as  conditional  distribution  given  fit,,  in  Bayesian  framework.  Suppose 
also  that  fiy’s  have  independent  distributions 

fihi~Np(Ui.A).  (4.1.3) 

where  Uj's  arc  pxl  column  vectors  and  A is  a pxp  symmetric  p.d.  matrix.  The  suspicion 
that  Qn  and  fij]  may  be  equal  is  reflected  in  the  choice  of  a priori  common  mean  tir  Note 
that  conditional  on  fin's  and  fi^'s,  Xu's  and  £21 's  are  mutually  independent  Also,  fin's 
and  Sjj's  have  independent  normal  priors,  standard  calculations  show  that  fin’s  and  fi^'s 
given  Xii’s  and  X2i’s  have  mutually  independent  posterior  distributions 

am  i Xu  - sin  - Np(0p-Yr  WYE-'*,  dp-yr'm,  (4.1.4) 

where  £ = (V+A)-  Now,  using  the  loss  function 

Lglfii.il)  = 1 (fiirai)TQ(fiirSi)  (4.14) 

for  estimating  fi(,  where  Q is  again  a known  pxp  p.d.  matrix.  The  Bayes  estimator  of  fli  is 

fifQO  = (Ip-XZ'lXi+YZ'll.  (4.1.6) 

where  X = (X|,  X2)  and  u = (Hi Uk)-  1"  order  io  find  an  EB  estimator  of  fii,  we 

of  Xu's  and  X2i's-  Note  that  marginally  Xhi's  arc  mutually  independent  with 

2£w~Np(Ui.£).  (4.1.7) 

Then  (2,  S)  is  complete  sufficient  for  (u,  £)  base  on  the  marginal  distributions  where  £ = 
(Zi Zk>.  Zi  = <X.ii+Xa>/2  ~ Np(Hi,  2-'£).  Xi  = 2(X|,-Zi)  - Np(fi.  2£)  and  £ = 
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r\zx$)  ~ lV(p,  K, 2).  Hence,  the  UMVUE  of  u is Z end  the  besl  multiple  estimator 

of  £'1  is  (K-p-i)S'1-  Substituting  these  estimators  for  |i  and  2"1  in  (4.1.6),  one  gets  the 
EB  estimator  ap-(K-p-l)YS'1)Zt+(K-p-l)YS'1Zof  fl,.  A general  class  of  EB  estimators 
of  &i  is  given  by 

af*(X)  = (Ip-  YS''l(Y.  S)«i+YS‘1lO£.  S)Z,  (4.1.8) 

where  I(Y.  5)  is  a pxp  symmetric  matrix.  As  in  Chapter  2,  we  prefer  this  notadon  even  if 
Y is  known,  since  later  in  the  case  of  unknown  Y,  we  will  use  KY,  S),  where  Y estimates 

y. 

(4.1.8).  It  is  well  known  that  under  the  loss  given  in  (4.1.5),  2£t  is  a minimax  estimator  of 
fi,  with  constant  risk  KWYQ)-  We  first  find  the  risk  difference  of  fl™  and  X,.  and  then 

repeating  the  materials  of  Chapter  2.  Section  4.3  provides  modified  versions  of  these  EB 
rainimax  estimators  as  proposed  in  Section  4.2,  and  later  are  shown  to  dominate  the  PTE. 

the  problem  of  estimating  a vector  of  normal  means  in  the  two-sample  situation  and 
proposed  both  EB  and  HB  estimators  which  dominated  the  first  sample  mean. 

The  problem  can  be  extended  to  multi-sample  case  (h  = 1 H,  H > 2),  in  which 

case,  several  matrix  normal  means  are  suspected  to  be  equal,  and  observations  in  all  the 
samples  can  be  combined  effectively  to  improve  estimation  of  a matrix  of  normal  means. 
This  is  discussed  in  Remark  4.2.1. 


We  Fira  compute  the  risk  of  the  estimator  as  defined  in  (4.1.8).  Throughout 
this  chapter,  we  denote  by  Eg,  the  expectation  conditional  on  fi  = (fij,  ih).  E the 
expectation  over  the  marginal  distribution  of  & and  E the  expectation  over  the  joint 
distribution  of  X = (X,.  X2)  and  & Also,  let  Rg(Si.  sQO)  = Eg[Lg(2i.  £(X)».  where  Lq 
is  defined  in  (4.1.5). 

Theorem  4,2,1  Under  the  Lg  loss  given  in  (4.1.5).  has  risk 
Rgffll.af) 

- Rqlfii.  Xi)  4 Egtr{  [S  'ltX.  S)Sl (Y.  SIS’1  -^SKY,  SIS'1] 

- 2(K-p-l)iQ'J  S)S'‘]Qo].  (4.2.1) 

where  Rgffl,.  X,)  = Ktr(Y  Q),  Q0  = V QV  and  D5  is  defined  in  Lemma  1.3.3. 

Proof  of  Theorem  4 2.1  Let  fi®(X)  = (Ip-X£'1)Xii+y.X',Ui  and  Iff(X)  = dp-  Yi'1 
KY,  S))Xii+YS''i(V,  S)Zi.  i = 1 K.  Standard  calculation  yields 


= ELg<a, . af ) + e(i  (fi®-aff)TQ(fiH  -aff> ) . (4.2.2) 

where 


Y(S''l(Y,  S)-£'l)(Xii-4)-Y£'l(4-Ui) 
I Y(S'‘l(Y,  S)-2r')Yi-YZ'l(2i-Ui). 


*.2.3) 
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Next  use  the  independence  of  Z,  and  Yi  (and  S)  and  the  fact  Zi  ~ Np(Uj,  2'IZ),  one  gets 
E(i(af1-ajf)TQ(afl-aff)l 
= E(2(4-Ut)VYQy£'l(Z,-Ui) 

+jii,T<a''ia  £)-£'i)tyqy(S'iky,  a-jr1)*,) 

= Etr((S(Zi-UiXS-Ui)T)2rlQo2'1 

*j(a-‘i(Y.  a-r'xj  i yp£><it<y,  ay'-s4)^} 

= ftr<rlQo)^Etr((a-,i(Y.a-rl)S(iT(Y,as-|-r1)Qo)-  <4-2-4> 

Applying  the  same  argument  to  obtain  (2.2.4)  and  the  fact  E(a  = K£,  it  follows  that  the 

5Etr{[s-,i(Yasi(Y.aa,-2jr1si(Y.as‘l]a,)+f“<£',Qo)-  («-s> 

From  (4.2.4)  and  (4.15),  it  foUows  that 

E(!(5fi^TQfflf|-ttff)) 

= Ktrffi-1Q0)^Etr{(5-|i(Y.asi(Y.as‘,-2jr1sKY.aal](i)).  (4.2.6) 

Note  from  (4.1.4)  and  (4.1.6)  that 

ELgiai.afi'KtriOp-Yir'aQ).  (4.2.7) 


1 (4.2.2).  (4.2.6)  and  (4.2.7), 
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Emai.fif) 

= KnO'ft)-^En-([5-|l(y.S)Sl(y,S)S'l-2£-|Sia,S)S’')a))-  (4.2.8) 

Next  apply  the  Wishan  identity  as  given  in  Lemma  1.3.3,  and  get 

E[£-lai(Y.S)S‘ll  =E(2CS[£l(Y.S)S'']*(K-p-l)Iffl.S)S-ll.  (4.2.9) 

Note  that  Rgffl,,  X|)  = ELq(2|.  X,)  « Ktr(V  Q)  for  all  0 Hence  we  find 

HLgffli.Sf) 

= ERg(ai,af) 

= ERgOi,  XO+EEjjtrl  [£'‘i(Y,  S)SllY.  SC'1  -4^)SHM,  S)S-1] 

-2(K-p-l)KXS)£'l]Qo)-  (4.2.10) 

The  conditional  distribution  of  0 = 2"1  £ XjXj  given  fi  depends  on  & only  through 
2-‘  I(flli-a2J)(flu-fl2i)T  - a'(say).  Also  the  family  of  distributions  of  fi  being  Wishan 
lV(p,  K,  A)  is  complete  for  A (or  £).  The  proof  of  the  theorem  is  complete  from  (4.2.10). 

In  view  of  (4.2.1),  an  unbiased  estimator  of  the  risk  difference  Rg(&i , fif3)- 
Rg(fli,Xi)U  given  by 

5 frlQj'VlO.  S)SlO.  SIS'1  -4dS(Si(Y,  SIS'1] 
-2(K-p-l)i(y,S)S'1]Qo/2).  <4 


4.2.11) 


■Qo  = QW-if’ 


KY.S) 


where  and  is  negative  with  positive  probability  for  all  fi,  then  we  obtain  EB  estimator  g^8 
which  dominates  X|  under  the  Lg  loss  and  is  minimax  afterward.  Notice  the  analogy 
between  (4.2. 11)  and  the  unbiased  estimator  of  the  risk  difference  Rg(g,  SpHtyg.  20 
obtained  from  (2.2.1)  in  one  sample  problem.  In  both  cases.  V,  g and  I(Y,  £)  are  pxp 
symmetric  p.d.  matrices.  We  can  proceed  exaedy  the  same  way  in  Section  2.2  to  find  two 
classes  of  EB  minimax  estimators  ' 1 and  g^8®  which  dominate  X]  under  the  Lg  loss. 

They  ate  defined  as  following: 

gf^QD  = dp-rtV.  5)YS'')Xrn(Y,  S)YS'’Z,  (4.2.12) 

where  rtV,  £)  (>  0)  is  a scalar,  and 

gf^'ffi)  = (ip-Y  S' 1 it(S))X  t +¥  S'  'a(S)Z,  (4.2.13) 

where  ffi(£)  is  a pxp  symmetric  p.d.  matrix.  Also  let 

gf^QO  = (Ip-nt S)yS''-VS''f(S))XrKrQ!, S>yS'l-*yS‘lf(S))Z,  (4.2.14) 

where  fig)  is  a pxp  symmetric  p.d.  matrix.  It  can  be  shown  that  gf0131  beats  g^8*1’  under 
Lg  loss  and  under  certain  conditions  on  r (V,  S)  and  f(S).  As  in  Section  2.3.  both  fi(£)  and 
f(S)  arc  of  the  form  E»DET.  where  £&)  is  a diagonal  matrix  with  ilh  diagonal  clement 
equal  to  VfiJ  > 0,  i = 1 p,  X = (2-, Xp)T  denotes  the  vector  of  eigen-values  of  S, 

vectors  of  S (S  = E AET.  A = diag(Xi Xp)). 

Remark  4.2.1  Suppose  there  are  H (>  2)  groups  of  sample  matrix  normal  means  Xti  - 
h = 1, ....  H.  It  is  desired  to  estimate  g]  - ^g)  on  the  basis  of 


(Xhl XhK>.l 


92 


Xh's  since  they  arc  suspected  10  have  die  same  populadon  mania  means  fi|.  In  this  case, 
we  define  Zj  = H'1  EXw.  Xj  = H(Xu-Zj)  and  S = [Hffl-l)]'1  ZXJ*.  where  marginally 

Zj  - NpOij,  H''B,  Xi  ~ Np(Q,  H(H-1)B  and  S~  W(p,  K,  B-  It  can  be  shown  that  the 
difference  of  risks  of  fi®  and  Xt  is 

(H-l)H-‘Efitr(  [S' 'icy,  SISKY,  SIS'1  -'>Ds[Sl(Y,  SIS'1] 

-2(K-p-l)J(Y,  SJS'lQo)-  (4.2.15) 

The  multiplier  (H-  1)H''  does  not  affect  the  way  of  finding  EB  minimax  estimators  af®01. 
Sf01  and  af*01  defined  in  (4.2.12)-(4-2.14),  respectively. 


In  this  section  we  show  how  a modified  EB  estimator  can  dominate  the  PTE  under 
the  Lg  loss.  A general  PTE  of  Si  is  given  by 

af^xi  = g(cixi+(i-g(ciiz 

= IIp-(l-g(C))Ip]Xl+<l-g(C))Z,  (4.3.1) 

where  g(Q  = C = 2'1  X X?v;%  = tr(Y''SI,  d being  a positive  constant  depending 

hypothesis  of  Bt  * 82.  C is  distributed  as  yj^.  We  propose  the  modified  EB  estimator 


“<xi  = graaf+d-gioiz 

= (Ip-[(l-g(Q)Ip+g(C)YS'lJb(Y.  SIHXi 


-K(l-g(C))Ip+g(CM£S‘,Io(¥.  5)12. 


*.3.2) 


where  T(/V.  S)  is  a pxp  symmetric  p.d.  matrix.  Note  that  af188  estimates  Si  by  Z if  g(Q 
= 0,  that  is,  the  null  hypothesis  is  accepted  at  a desired  level  of  significance,  and  it 
estimates  Bi  by  an  EB  estimator  Sf8  when  the  opposite  is  true.  Also,  aj'188  continues  to  be 
a weighted  average  of  the  pooled  sample  mean  and  the  first  sample  mean  which  reflects  the 
incorporation  of  degree  of  evidence  for  or  against  the  null  hypothesis.  The  following 


Theorem  4.3,1  Consider  the  Lg  loss,  the  difference  of  risks  between  Sf188  and  af18  is 
given  by 


RgGi.af^Rgfai.an8) 

5Efi(g(C)[S-|Jo(Y.S)Sloa  SIS'1  -AD^iofY,  SIS'1] 


- 2(K-p- 1 )lo(Y,  £)S' 1 ] Qo  ] ■ (4.3.3) 

Proof  of  Theorem  4.3.1  Write  l,(y,  S)  = (l-gCCMSy1  and  ijtyS)  = (l-g(C))Sy-' 

+g(Oio(y.  si = iicy,  s)+g(c>io(y.  s).  Then  = up-ys'1ii(y,  siixi+ys'1 

IlfV,  S)Z  whUe  a^B(X)  = (Ip-Y S''l2(y,  S))Xt+ya‘lU(y.  S)Z-  Note  i,<y.  S)  and 
S)  are  not  symmetric.  Let  aff^QD  = dp-y S‘'uQ£.  S))Xii+yS''j2(y.  S)Zj  and 
a^UO  - 0,,-y S-'n(y  S))Xu+yS‘‘li(y  S)Zi.  Then,  applying  (4.2.2)-(4.2.4>  twice, 
once  with  tfy  S)  = I,<y  S).  and  next  with  i(V,  S)  = I2<y.  S),  one  gets 


ELgta,,  al^j-ELgfai,  af18) 

= 5Htr|[<S'll2(y.5)-£'')S(lT(y.  SJS'-S'1) 


-(S'liia.s)-jr')S(i]'(y.s)a'1-r‘)]Qo)  • 


(4.3.4) 


Since  fcti.  £)  = I,(V,  5)+g(C)lo(X,  S)  and  Iq CL  S)  is  symmetric,  we  rewrite  the  right- 
hand  side  of  (4.3.4)  as 

5 Etr|  [g2(os'1^o',s)sioffl.  s)S'l+g(C)a-|ii(v,  asioty.  as-1 
+g(C)S‘lioQ(.  S)Sl{ (V,  £)S‘l-g(C)£'llo<Y,  Sfii'1 
-g(C)2TlSlo(Y,S)S'l]Qol  (4-3.5) 

Also.  g(Qi,(Y,  a - g(Q(l-g(C))5r 1 =■  0,  g2(Q  = g(C),  and  triS^IoCY,  SJSZ'Qo)  - 
trarlSloQ£.£)S‘lQo).  Hence, 

= j Etr|  [g(C)£'  'loGC,  S)S  loCy,  S)S‘ 1 -22T 1 g(C)S  lo<y,  SB’ 1 ]Qo ) ■ (4.3.6) 

Since  marginally  S - Wlp,  K.  2),  apply  the  Wishart  identity  as  given  in  Lemma  1.3.3  to 
g« 

E{r1[g(C)£l0(Y.S)S'1]) 

- E ( 212s  (g(QS  lo(Y,  £)S‘ 1 1 -KK-p- 1 )S‘ 1 Ig(C)Sio(Y,  S)S' 1 ] ) 

- E(2[ES  glQlSioiy.  S)5',+2[(g(C)lI1)T(D5)T]T[5loa.  5)i'') 

+(K-p-l)[g(C)lo(y.S)S'1)).  (4.3.7) 

where  DS  g(C)  = (@g(CV9sjj».  3g(C)/3sij  = 0 a.e..  Thus,  from  (4.3.6)  and  (4.3.7), 


ELQlfli.a^-EL^^.Sf1 
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= i(nStti.^*B)-RQ(fti.sriB)j 

= iEEfl(g(C)is'lioa.s)Sio(i!.s)a’l-4Esisioas)S'1] 

- 2(K-p-l)i0(y,  £)S‘']Qo)  ■ <4-3-8> 

Again,  by  using  ihc  completeness  argument  used  in  ihe  proof  of  Theorem  4.2.1,  (4.3.3)  is 
the  direct  consequence  of  (4.3.8).  The  proof  of  the  theorem  is  complete. 

The  above  theorem  provides  an  unbiased  estimator  of  the  risk  difference  of  a 
modified  EB  estimator  and  the  PTE.  which  can  be  recognized  as  g(C)  times  the  unbiased 
estimator  of  the  risk  difference  of  the  corresponding  EB  estimator  and  Si  by  examining 
Theorem  42,1.  Note  that  Pr(g(Q  > 0)  = Pr(g(C)  - 1)  > 0.  Hence  the  EB  minimax 
estimators  af8*0.  Sf8®  and  flj8®  as  given  in  (4.2.12),  (4.2.13)  and  (4.2.14),  can  be 
modified  to  get  g}*B(,>,  fl}®8®  and  1““™  respectively  and  they  all  dominate  the  PTE 

aMBW’oo 

- Ilp-K 1 -gOlp+gfCMY,  S)Y  S' 1 1 IXi*-[(  l-g(Q)Ip+g(C)r(X.  m S' 1 XL  (4.3.9) 

- ( Ip-K  1 -g<0)Ip+g<C)^  S'  ‘SBISJ] ) 2C| +10  -B(C»Ip+8<C)^  S’  'SB(S)1Z,  (4.3.10) 


gMEBC!)^ 


(iP-((i-g(Q)iI1+g(c)ras)ys'l+g(C)ys'lf(S)]]Si 


K(  1 -g(C))Ip+g(OiO'.  S)Y  £' 1 +g(QY  £'  'f(S)]Z. 


EMPIRICAL  BAYES  MINIMAX  ESTIMATORS  OF  MATRIX  NORMAL  MEANS 
FOR  ARBITRARY  QUADRATIC  LOSS  AND  UNKNOWN  COVARIANCE  MATRIX 
IN  THE  TWO-SAMPLE  PROBLEM 


5.1  Introduction 

investigated  the  estimation  of  one  of  the  two  sets  of  matrix  normal  means  when  it  is 
suspected  that  the  two  matrix  normal  means  are  equal.  In  many  practical  situations,  V is 
often  unknown  and  needs  to  estimated  from  the  supplementary  data.  Suppose  v,  a 
symmetric  matrix  independent  of  X = (Xt . X2).  is  available  as  an  estimate  of  V.  This  is  the 

(4. 1 .3),  but  relax  the  assumption  of  known  Y to  the  following  three  assumptions  about  Y 
with  a complete  sufficient  statistic  Y independent  of  X B (Xt-  X2L 

(i)  Y = vQ,  Q known,  symmetric  and  p.d.,  and  v > 0 unknown, 

Y = vG,v~v(n+2)-'3£j,  (5.1.1) 

where  is  chi-square  distribution  with  n degrees  of  freedom; 

(ii)  Y-Yd  = diag(V|, ....  Vp),  a diagonal  matrix  with  ith  diagonal  element  v,  > 0 
and  all  vj's  unknown. 

Vd=diag(v, Vp).  V|  - VjOi+a'1^.  independently;  (5.1.2) 

(iii)  Y symmetric  p.d.  and  entirely  unknown.  With 


Y ~ (n+p+ir'Wfp,  n.  Y).  n > p+1. 


(5.1.3) 


(5.1.4) 
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In  such  cases  the  estimators  of  0|  are  compared  on  the  basis  of  general  quadratic  1 

Lg-ltti.  a)  - I (flir»i)TQ’(ttir*i). 

where  Q*  = Y'1/2QY'1/2.  and  again  Q is  a known  pxp  p.d.  matrix.  The 

multiplier  Y"l/J  used  in  the  loss  function  because  that  makes  Xi  a minimax  estimator  of  Bi 
with  the  constant  risk  Ktr(Q)  not  depending  on  any  unknown  parameter. 

Next,  we  illustrate  the  availability  of  a complete  sufficient  statistic  2 from  the 
supplementary  data  for  estimating  the  unknown  Y in  a balanced  two-way  MANOVA  model 
given  in  (4.1.2).  First,  suppose  Yo  = vS-  only  v is  unknown  and  G is  p.d..  Let  2 = vG. 

where  v = (m(2Kp(m-l)+2)]"ltr(  I X 1 (Xhij-2ihi)£j-' ‘ (21hij-Xj,i)T ) - Recall  Xu  = 
h=l  i=l  j=l 

m"1  x 2£j,ij.  Thus.  Xu ' Shi  ~ Np(ehi.  m"'vG)  and  v - v[m(2Kp(m-l)+2)r'x^Kp(m.i).  If 
>■1 

Vq  = Yj  = diag(vi Vp),  an  unknown  diagonal  matrix  with  ith  diagonal  clement  Vj  > 0, 

let  2 = Yu  - diagfvt vp),  where  5i is  (n>(2K(m-l)+2)r‘  rimes  the  ith  diagonal 

element  of  { X X .1  (Xuj-XW)(Xhij-XW)Tl- this  case.  Xu  I Shi'  Np(0u.  m'Yd)  and 

vj  - v;[m(2K(m- 1 )+2)]‘ 1 XaK(m- 1 ) independently.  Finally,  if  Yo  « complete  unknown,  then 
define  2 = [m(2K(m-l)+p+l)l"'( ^X  j;  £ (Xuj-XuXXuj'Xu)T)-  Now.  Xu  ifihi  ” 
Np(fiw,  m"‘Yo)  and  Y - [m(2K(m-l)+pt-l)]"‘lV(p.  2K(m-l),  Yo)- 1"  each  of  above  three 
cases,  the  minimal  sufficient  statistic  for  <0,.  fc,  Y)  is  (Xt.  X2.  2).  where  Y = tn'Yo- 
Note  that  conditional  on  0 = (Si.  02>.  Xu's  and  2 are  independent,  and  the  distribution  of 
2 does  not  depend  on  0!  and  02.  Also  marginally,  (Z.  S.  2)  is  the  complete  sufficient 

statistic  for  (u,  £.  Y).  where  Z = <Zi Zk).  Zi  - (Xu+XjiV2  - Np(Uj,  2"'B,  Xi  = 

2(Xti-Zi)  - Np(a  25),  and  & = 2"1  <?  Y-Yj)  - lV(p.  K.  S).  Hence,  the  UMVUE  of  B is 

Z.  while  the  best  multiple  estimator  of  and  Y are  (K-p-l)S'1  and  2.  respectively. 
Substituting  these  estimators  for  u.  £,  and  Y in  (4.1.6),  one  gets  the  EB  estimator  of  0|  as 


ap-(K-p-i)2s‘,)Xi+(K-p-i)2s‘lz- 


(5.1.5) 


More  generally,  we  consider  the  class  of  EB  estimators 

afix,  2)  - a,-  2s’1i(2,s)>Xi+2s',a2.  vz,  cs.i.o 

where  1(2.  S)  is  a pxp  symmetric  matrix. 

In  Section  5.2,  Theorem  5.2.1  connects  the  risks  of  g^8  and  Xi,  and  it  shows  that 
the  methods  of  finding  EB  estimators  in  Section  3.2-3.4  can  be  adapted  here  for  finding  EB 
estimators  of  g,  under  three  different  assumptions  about  V defined  in  (5.1.1)-(5.1.3), 
respectively.  Again,  the  EB  estimators  proposed  in  Section  5.2  are  weighted  averages  of 
the  pooled  sample  mean  and  the  first  sample  mean.  The  weghts  intend  to  reflect  the  degree 
of  evidence  for  or  against  the  null  hypothesis.  Unlike  the  PTE,  which  selects  either  the 
pooled  sample  mean  or  the  first  sample  mean,  the  EB  estimator  achieves  a compromise 
between  the  two.  It  is  shown  in  Section  5.3  that  corresponding  to  every  EB  minimax 
estimator,  there  exists  a corresponding  modified  EB  miniraax  estimator  which  dominates 
the  PTE.  The  modified  EB  minimax  estimator  like  the  PTE  is  dichotomous.  Both  these 
estimators  equal  the  pooled  sample  mean  when  the  null  hypothesis  is  accepted  at  a desired 
level  of  significance.  However,  if  the  opposite  is  the  case,  while  the  PTE  equals  first 
sample  mean,  the  modified  EB  minimax  estimator  equals  the  corresponding  EB  minimax 


In  the  next  two  sections,  we  denote  by  Eg^,  the  expectation  conditional  on  g = 
(fit  - 02>  and  Y,  E the  expectation  over  the  marginal  distribution  ofg,  and  E the  expectation 
over  the  joint  distribution  of  X = Q£i , Xq).  S and  Y-  The  following  theorem  provide  the 


risk  of  fi®(  2,  X)  defined  in  (5. 1 .6)  under  the  Lq-  loss  without  specifying  the  assumption 
regarding  the  structure  of  the  unknown  V.  We  only  need  the  independence  of  V and  X = 
(Xt.Xj)  to  prove  the  result. 


Theorem  5.2.1  The  risk  Rg*(fi|,  fif8)  under  the  loss  given  in  (5.1.4)  is  given  by 


Rg-ffli.af) 

= Rg-<2i.  Xi)+2EfiY.ir(£'1i(y.S)Si(2,  as'vQ'v 
-4I&SKY.  sia'iSo'y-  2<k-p-i)k2,  sis'yg'y),  (s.2.i> 

where  Rg-(fl,.  Xt)  “ Ktr(yQ*)  = Ktr(Q). 

Proof  of  Theorem  5.2.1  Let  E?j(2,  X)  = (l,-yX‘')Xl,+y  I 'lL  and  9^(2.  X)  - 
dp-  ya''l(V,  £))Xii+y  S‘‘l(y.S)Zi.  > = 1 K.  Standard  calculation  yields 


ELg-(9,.9f) 

= ELq*(9i,  5f)  + E(  X (ftf,-sff)TQ*  <Sf|  -fiff) ! • (5.2-2) 

a^ffli.af)-  Ktridp-yjr'jys'i.  (5.2.3) 


(ys''i(y.  s^yr'xxii-ziJ-yr'ai-ui) 

= ^(ys'li(y.s)-ys'1)ii-yr1(ZrUi)-  (5 


5.2.4) 


of  Zj  and  dj,  2)  and  the  fact  that  marginally  Zi  - 


Np(ui,2-1S),  one  gets 

- E(i(zi-Ui)T2r'i:Q'y£-1(Zi-Ui) 

+j.I3;t(25'ii(2,  s)-yjr‘)TQ‘(y  ff'idf.  si-yr1)^) 

- Etr(<  I (ZrttX&ttl^'y Q'yjr1 

+j<y  a'ny.  £D-ys‘)<5  ii*T)GT<y.  sis'y-jr'yjQ*} 

= |>rtr1yQ'y}^Etr((YS''i(y,S)-y2r')SGT(y,sjs-1y-r,y)Q‘). (5.2.5) 

5&r|(ys’li(y.s)-ys-1)SGT(y.a)S''y-2r,y)Q’l 
= jEiris-'Ky.  s)Si<y.  s)S‘'y  o'y  -TJr'ssy.  ssr'y  Q‘y) 

+fir(Z'lyQ'y),  (5.2.6) 

by  die  fact  E(S)  = K£  and  using  the  same  algebra  to  get  (3.1.5). 

From  (5.2.3),  (5.2.5)  and  (S.2.6),  it  follows  that  the  right-hand  side  of  (5.2.2)  is 

Kn<yQ*)4^Etr(s'li(2.  s)si(y.  s)a-1y  s'y  -2s-‘si<y.  s)S‘‘y  Q'y).  (5.2.7) 

Since  V is  independent  of  X = (Xt,  &)  and  S,  we  apply  the  Wishart  identity  as  given  in 
Lemma  1.3.3  again  to  ElZ'Sfiy,  S)S'')- Thus  EL^-ffl,.  fif8)  can  be  expressed  as 


102 


ERg-ffli.If) 

= ERg-(fii,  Xl)+jEEgytr(S'1I(2,  S)Sl(2.  S)S’'2  q‘2 

-jiAskv,  sja-'jyg’y-  2(K-p-i)i<y,  s)£-'yQ‘y),  (5.2.8) 

where  Rg-(fi,.  X,)  = Ktr(y.Q')  for  all  a,.  Now  the  conditional  dismbuiion  of  £ = 
2'1 2 Y.yT  given  g depends  on  a only  through  2‘1£(fl11-fla)(flli-fl2i)T  = S‘(say)  and  2 

is  ancillaty.  Also  the  family  of  distributions  of  Q being  Wishart  lV(p,  K,  A)  is  complete  for 
A (or  £).  Thus.  (5.2.8)  holds  even  when  the  E expectation  is  removed.  The  proof  of  the 
theorem  is  complete. 

Next  observe  the  analogy  between  (3.1.3)  and  (5.2.1).  As  in  Chapter  3,  we  have. 

fifB<l>(2.  X)  = (Ip-r(2.  S>2  S‘*)Xi+r(2.  S)2s'z.  (5.2.9) 

where  t<2.  S)  (>  0)  is  a scalar,  and 

5^(2.  X)  = (Ip- 2 S‘  'aKS))X  1 + 2 ‘sefSJZ.  <5.2.10) 

where  a(S)  is  a pxp  symmetric  p.d.  matrix  and  is  of  the  fotm  EM1,  where  £(i)isa 
diagonal  matrix  with  ith  diagonal  clement  equal  to  M'i(i)>0,  i = 1, ....  p;  t = (Xi, ..., 

vectors  equal  to  the  orthonormal  eigen-vectors  of  S.  that  is,  S = E AET  and  A = diag(X|. 

....  Xp). 

Also  can  be  improved  by 


'’(2.X) 
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= ap-i<S.a)5s'l-2a1£<a))Xi+(r<2.a)ya'l+2s‘lr(S))2.  csaii) 

where  £(£)  is  also  of  ihe  form  E5i(i)ET  defined  as  above.  The  dominance  holds  for 
certain  choice  of  r(2.  S)  and  !E(D  which  vary  with  the  assumptions  on  V,  as  shown  in 
Section  3.2-3.4. 


Remark  5.2.1  We  can  extend  the  number  of  groups  of  sample  matrix  normal  means  to  H (> 
2)  and  make  the  same  adjustments  for  Z,  and  £ shown  in  Remark  4.2.1.  In  this  case,  the 
difference  of  risks  of  gf8  and  J£|  is 

(H-l)H-|E8vtr(£‘1«2.  £)£l(2.  S)S''2  Q*2-4I^ISi(2.  s)s']2q'y 
-2(K-p-1)S(2.S)S‘'2q'Y.|.  (5-2-12) 


5,3  Improvement  Qn.KIE 

When  the  covariance  matrix  V of  Xm’s  is  unknown,  a general  PTE  is  given  by 
8^(2,  X)  = g(C")2£|+(l-g(C’))Z 

= Op-d-glC'MyXt+d-glc’BZ.  (5.3.1) 

function.  The  choice  of  d is  governed  by  the  level  of  significance  dial  is  used  for  testing  the 
null  hypothesis  of  no  difference  between  group  means,  that  is.  Ho  : fit  = fc-  The  statistic 
C*  = c’(2.  £)  is  a scalar  function  of  2 and  £.  For  example,  when  V is  complete 
unknown,  it  can  be  Wilk’s  Lambda  (iyi/IY+£l),  Lawley-Hotelling  Trace  statistic  trf  V £-1), 
largest  eigen-value  of  (2  S'1 ).  etc.  To  incorporate  the  degree  of  evidence  for  or  against  the 
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null  hypothesis  that  PTE  fails  to  do,  we  propose  the  modified  vetsion  of  EB  estimators  fi® 
defined  in  (5.1.6)  as 

1^(2.  20  = g(C>af-Ki-g(c'))z 

= (Ip-IO-SfC’tV8*0’^  S)1  )2Ci 

+[<l-g(C'>)lp+g(C,)2  S''lo<2.  S)]Z.  (5-3.2) 

where  So< 5.  £)  * a pxp  symmetric  p.d.  matrix.  Note  that  gf®8  is  a weighted  average  of 
2£,  and  Z with  extremes  Z and  ft™  when  g(C’)  = 0 and  1 . respectively. 

The  relationship  between  the  risks  of  a PTE  gf18  and  a modified  EB  estimator 


Theorem  5.3.1  The  risk  difference  of  If®8  and  gf18  under  the  L<j*  loss  is  given  by 

Rg.(g..fif8)«5-(g|.arrE) 

= jEfli,tr(g(c-)is-|io(5.s)Sio(2.S)a'l2Q'2 
-4d5(sio(2.  s)S''i2q‘y-  2(k-p-i)io(2.  S)S‘‘2  Q’y]  )■  (5.3.3) 

Proof.of  Theorem  5,3.1  Le.fr&S)  - (l-g(C,))s2'1  andu&S)  = (i-g(C'))s2'‘ 
+g(C')Io(2.  S)  = Ii(2.  S)+g(C’)Io(2.  S).  Then  we  rewrite  gf18^.  20  - (Ip-  2^' 
ii(2.S))x,+2£-'ii(2.  sjz  and  g^(2.  x)  - a,,-  2s’‘ij(2.  £))2£i+2s''i2<2, 

S)Z  Note  that  Io(2.  S)  is  symmetric  but  I,  (2.  S)  and  12(2.  £)  are  noL  Then  let  gfpfV, 

20  = (ip-2s''ii(2.  s))2tii+2£’‘ii(2.  sj^i  and  g^EB(2, 20  = ap-2a'lu(2, 
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mii+ya-'fetS.  S)Zj-  Then,  applying  (5.2.2)  and  (5.2.5)  twice,  once  with  1(2,  S)  = 
1,(2.  S).  and  next  with  fi2,  S) = fc(2.  S).  we  have 

ELg-ffli.  5^) 

= *Etr([(2s‘'i2(2.  S)-Y£'')S(jJ(2. asr'Y-r'Y) 

-(2  s-'ii(2.  a-Yr'jsttl <2,  S)S'‘2-s'y))q'  ) 

= |Etr{[g2(C")2S',Io(2.S)&Io(2.S)£'l2 
+g(C)2s-|i,(2.  s)£si(2.  S)£',2+g(c’)2£',i0CY.  S)Si{(2.  S)S''2 

-g(C")2  S''lo(2.  S)£S'1Y-g(C")Y£‘1£l0(2.  £)£''  2]Q’  ) . (5-3.4) 

since  Ij(2.  S)  = 1,(2.  SHg(C")io(2.  S). 

Next  using  the  facts.  g(C')(l-g(C*)) =0.  g2(C")  = g(C'),  and  tr(Y£'lj0(2.  £)£ 
Z'YQ')  = tr(Y2‘,SS)<Y.  S)£''2q")  = tt(Z'1£l®(2.  S)5''2q‘  Y),  one  gets 

EL^fflt.  af^-EL^ta,.  af1^ 

= iEtr(g(c‘)a-'io(2,  a)aio<2.  S)S''2  o'  2 

+22rlg(C')£i0(2.  £)£‘‘2  Q’  Y).  (5-3-5) 

Recall  2 is  independent  of  X and  S-  By  using  the  Wishan  identity  as  given  in  Lemma 
1.3.3  along  with  ag(C')/3s,j  = 0 a.e.  and  12s  g(C*)  = (Og(C")/3sij))  = 0.  the  following 
can  be  obtained. 


E(r,g(C')£io(2,  £)£•') 
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= El^  [g(C')Slo(5,  S)S'l]+<K-p-l)£'l[g(C’)£lo(9,  SIS-1]  ] 

= EI2IE5  g(c')isa(2,  sis'1+2((i(c')ip)Ta2s)T]T[Sio<2.  S)S‘'i 
■KK-p-l)g(C*)lo(2,  S)S_I  I 

= E(2g(C')CslSl0(y.S)S'll+(K-p-l)g(C')Io<2,£)S‘1].  (5.3.6) 

Combining  (5.3.5)  and  (5.3.6),  one  gels 

EI^' (fi, , S^-ELg-lfi, . af1®) 

= EtEjyLQ-iai.a^-Eaj-Lq.taLar®)) 

= i EE2yir(  g(C')[S'lJ0(5,  S)Slo(Y,  SJS'y  Q'y 

-4D5isio(y.  S)a-')S  q'y-  2(k-p-djo(v,  asr'S  q'jy]).  (5.3.7> 

Then  applying  the  completeness  argument  as  in  the  proof  of  Theorem  5.2.1  and  the 
ancillarity  of  V,  one  gets  (5.3.3)  from  (5.3.7)  directly.  The  proof  is  complete. 

In  view  of  Theorem  5.2.1  and  Theorem  5.3.1,  unbiased  estimators  of  the  risk 
differences  of  5®  and  2tj  . and  fip8  and  flf1®  are  different  only  in  a multiplier  g(C*). 
Note  that  Pr(g(C*)  > 0)  = Prfg(C’)  = 1)  > 0.  Hence  under  the  Lq*  loss,  given  an  EB 
minimax  estimator  aP  and  a PTE  af1"8  of  Bi.  we  can  always  produce  a modified  EB 
estimator  Bp8  which  dominates  Sf18  under  the  same  conditions  where  an  EB  minimax 
estimator af  dominates  X,.  Thus,  ail  the  modified  EB  estimators  if®8®.  Sf®8®  and 
Sf®8®  of  EB  minimax  estimator  Sf8®.  If®  and  SP®  as  given  in  (5.2.9M5.2.11) 
respectively  dominate  the  PTE  under  the  Lg*  loss.  Note  that 
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fl{®*»(2.20 

- (ip-ia-gfc'Dip+gic'wy, S)5s4iiXi*{(i-g(c'))ip+g(c-)r(2,  siSs-'iz. 

(5.3.8) 

Sf^cy.X) 

- Hp-KI-gCC'illp+glC'liS-'alSJlIXi^Kl-glc'Dlp+glc'jyS-'sKSllZ.  (5.3.9) 

^EBO)(2.2D 

= (lp-l< 1 -BCc*))Ip+g<c*)r(  y,  aya-'+gCC’^S-'KSMlXi 
•K(l-g(C‘))l]1+g(C’)r(2.S)SS‘1+g(C')2S'l£(S)lZ. 


(5.3.1 


CHAPTER6 

SUMMARY  AND  FUTURE  RESEARCH 


6.1  Summary 

This  study,  as  noted  in  the  previous  chapters,  demonstrates  Stein’s  phenomenon  in 
the  problem  of  estimation  of  matrix  norma!  means.  The  estimators  dominating  the  sample 
mean  are  obtained  by  EB  methods  which  use  information  from  similar  sources  in 

"borrow  strength  from  the  ensemble". 

The  proposed  EB  estimators  are  compared  in  term  of  risks  under  the  quadratic  loss 
with  a p.d.  weight  matrix.  It  should  be  noted  that  we  only  concentrate  on  estimators  do  not 
depend  on  the  weight  matrix  being  used.  In  this  way,  the  proposed  EB  estimators  enjoy  a 
great  degree  of  robustness. 

and  3.  Combined  with  the  quadratic  loss  being  considered,  these  two  chapters  cover  all  the 

In  Chapters  4 and  5,  the  problem  of  achieving  a compromise  between  the  first 
sample  mean  and  pooled  sample  mean  is  solved  by  the  proposed  three  classes  of  modified 
EB  minimax  estimators  for  both  known  and  unknown  covariance  matrices.  Each  modified 

sample  mean,  and  dominates  the  PTE  in  terms  of  risks  under  the  quadratic  loss.  Such 
estimators  are  modified  EB  minimax  estimators  which  have  uniformly  smaller  risks  than 


the  first  sample  ■ 
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There  are  many  directions  in  which  to  pursue  future  research,  staying  in  the 

matrix  for  comparing  the  performance  of  some  estimator  in  each  column  of  the  matrix 
normal  means,  one  can  use  different  weight  matrices  for  each  of  them.  We  can  also  extend 
the  results  to  multivariate  regression  models,  for  example  generalizing  the  work  of  Ghosh. 
Lee  and  Littell  (1988).  Also,  we  propose  to  undertake  the  study  of  EB  minimax  estimators 
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